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Transmission Characteristics of a Three- 
Conductor Coaxial Transmission Line 
with Transpositions 


By G. RAISBECK and J. M. MANLEY 
(Manuscript received October 30, 1957) 


This paper describes in detail the manufacture of, measurements on, and 
computations about a three-conductor coaxial cable with transpositions. By 
reducing skin-effect losses in the central conductor, this three-conductor line 
achieves 17 per cent lower attenuation at a particular frequency than does a 
two-conductor line of the same outer diameter. The matrix method used for 
analysis can easily be extended to n-conductor lines. Suggestions for making 
improved three-conductor coaxial lines are made. 


I. SUMMARY 


Most of the loss in a simple coaxial cable is due to series resistance in 
the center conductor. At high frequencies this resistance is aggravated 
by the skin effect, which causes conduction currents to concentrate in a 
thin layer on the surface of the conductor. Because of the skin effect, the 
effective cross-sectional area of a conductor is smaller than its geometric 
cross section at high frequencies. 

The effective cross section of the center conductor of a new coaxial 
line has been increased by dividing the conductor into two parts, a solid 
central member and a concentric thin shell insulated from the center 
member. Over a certain range of frequencies, conduction currents are 
distributed throughout the whole thickness of the thin cylindrical shell 
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and a layer one skin-depth thick on the surface of the solid center mem- 
ber. | 

The total current in the composite center conductor is divided into 
two components, one flowing in the solid center member and one flowing 
in the thin cylindrical shell. The effective resistance of the composite 
center conductor is less than that of a solid conductor only when these 
two current components are approximately equal both in phase and in 
magnitude, and then only over a limited frequency range. The desired 
division of current has been achieved by electrical transposition of the 
two parts of the inner conductor at intervals of less than one-eighth of a 
wavelength. 

Several thousand feet of such a three-conductor transmission line have 
been constructed with transpositions at various intervals. Detailed meas- 
urements and calculations have been made, and the three-conductor 
cable has been compared directly with a similarly constructed conven- 
tional two-conductor cable. The three-conductor cable has lower at- 
tenuation than the two-conductor cable at frequencies between 1 and 
10 me with transpositions 9 feet apart or less. Near 4 mc, the three- 
conductor cable has 17 per cent less attenuation than does the two-con- 
ductor cable. The agreement between measurements and computations 
is excellent. The improvement in the three-conductor cable can be in- 
creased to 25 per cent by changing the relative dimensions of the parts 
of the center conductor, and the frequency range over which it sur- 
passes a two-conductor cable can be increased thereby to 20:1. 


II. BACKGROUND 


2.1 Introduction 


A transmission line with three parallel conductors can support two 
modes of transmission at low frequency.’ Each mode has a well-defined 
phase velocity and attenuation at every frequency. It might be assumed 
that the transmission losses on such a line are minimized if power is 
transmitted only in the mode having the lower attenuation. If the line 
is sufficiently long, this is true; but for short sections of line it is not. The 
losses in a short section of line may be reduced, under some circum- 
stances, by a judicious combination of pure modes. 

The reason why the losses may be smaller for a combination of pure 
modes than for any of the modes separately depends on constructive 
and destructive interference. If two pure modes propagate with different 
wave velocities, the currents which they induce in the conductors will 
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be successively in phase, out of phase, in phase again, and so on. When 
the currents are in phase, the losses are greater than when the currents 
are out of phase. Over a long length of line, the phases will pass through 
many cycles, and the amount of power lost will average out to be the 
same as if the pure modes were propagated independently. 

If two modes are propagated over a short length of line so that there 
is substantial cancellation of currents, the losses will be reduced so long 
as the currents do cancel substantially. When the differential phase shift 
between modes begins to be appreciable, it will be necessary to terminate 
the line, draw off the power and launch it in a new length of line. If this 
is not done, the reduction in losses will be offset precisely by an increase in 
losses when the currents of the two modes become in phase. 

This phenomenon can be used to combat losses due to skin effect in a 
coaxial line. In a simple coaxial line, most of the effective resistance of 
the line is in the center conductor. This effective resistance can be re- 
duced if the center conductor is replaced by a smaller center conductor 
surrounded by a thin coaxial tubular conductor, provided the magnitudes 
and phases of the currents in these two inner conductors bear the right 
relation. As will be shown, this can be accomplished by transposing the 
two parts of the center conductor, which is equivalent to the process 
described in the previous paragraph. 

The problem of losses in a three-conductor transmission line has much 
in common with that of losses in a two-conductor line that has standing 
waves. The three-conductor line has four waves, one forward and one 
backward in each distinct mode of propagation, and these four waves 
may beat in various forms of destructive and constructive interference. 
In order to get an intuitive grasp of the performance of such a line, it is 
convenient to subdivide the problem into two parts: finding the loss in 
the line for a given distribution of currents, and determining how the 
current distribution which yields the lowest losses for a given power 
transfer can be attained. The answer to the first question tells how much 
a coaxial line can be improved; the answer to the second tells how to do 
it. 

2.2 Losses in a Three-Conductor Coaxial Having a Given Current Distribu- 
tion 

In the next section an accurate calculation of properties of a three- 
conductor coaxial will be made. But a good idea of how much the loss 
may be reduced in this kind of line, under the proper conditions, may be 
obtained by assuming a simple model for the line, as shown in Fig. 1, 
and also assuming that the currents in the three conductors are known. 
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I, =-(I2+1)) 





Fig. 1 — Three-conductor coaxial transmission line: cross section and sche- 
matic longitudinal section showing surface resistance. 


The loss in such a line is given approximately by 
L=3|h PR,” 
+ Bl] Li PR? + (ils + His)Ro®” + | Is [Ro] 
+ 4$[Is|’Raa”, 


where J; and J; are the current amplitudes in the inner and outer con- 
ductors respectively and the resistances are the surface resistances cal- 
culated from formulas (83) in Schelkunoff’s paper” on cylindrical trans- 
mission lines.* The superscripts in parentheses refer to the particular 
conductor being considered; Raa is the resistance of the inner surface of 
a tube per unit length to current which is returned entirely inside the 
surface, Ry is the resistance of an outer surface of a tube per unit length 
to current which is returned entirely outside the surface and R,y is the 
transfer resistance from one surface of a tube to the other. 

Of the five terms in the equation above, the first is the ohmic loss in 
the center conductor. The second, third and fourth terms are the loss in 
the intermediate conductor. The resistances Raa, Rav and Ry are real 
functions. 

It is obvious that only one of the five loss terms depends on the rela- 
tive phases of J; and J; . This is the third term, which contains the factor 


(ils = IIs) = 2 I, | | Is | cos 6, 


* The reader should note that the first-order correction terms in Schelkunoff’s 
formulas (82) are in error; formulas (75), from which they were derived, are cor- 
rect. Briefly, Zaa , Zap, Zsa and Zpp are the ratios of longitudinal electric field 
strength on the inner or the outer surface of a tubular conductor to the part of 
the total current returning either inside or outside the tube. The real parts are 
denoted by & with the same subscripts. For details, see Ref. 2, pp. 554-558. 
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where @ is the phase difference between J; and J; .. Any reduction in losses 
depends on having RF,» large and choosing J; and /3 so that the term con- 
taining Ra, is negative. 

These theoretical calculations show that the losses in a coaxial trans- 
mission line can be reduced by using a third conductor. Suppose, for 
example, that the skin depth, the thickness of the inner dielectric layer 
and the thickness of the intermediate conductor, are all small relative to 
the radius of the center conductor. Then the various loss formulas are 
simplified and it is easy to compare the losses in the two innermost con- 
ductors with the loss in a solid conductor of the same over-all dimen- 
sions. This comparison is made in Tig. 2. Here, the dotted and dashed 
curves show the resistances of the two conductors compared to the re- 
sistance of a single solid conductor. or the dotted curve, the distribution 
of currents is chosen to minimize the resistance. The required distribu- 
tion is shown in the solid curve, which gives the best ratio J3/I,. The 
dashed curve shows the relative resistance if the current is divided 
equally in the two inner conductors. In either case, the resistance may 
be reduced to 65 per cent of that of a solid wire for a particular frequency. 

In a normal coaxial line 78 per cent of the effective resistance is in the 


RATIO I,/I3 FOR WHICH LOSS IN 
CENTER CONDUCTOR 1S MINIMUM 


mo--- LOSS RELATIVE TO A SOLID 
CENTER CONDUCTOR OF THE 
SAME OUTER DIAMETER 


—— Loss iF I,/I3= 0.5 
7 
























































ie) 0.4 0.8 1.2 1.6 24 2.8 3.2 3.6 4.0 


2.0 
u=2t/o 


Fig. 2— Comparison of losses in simple and compound center conductors for 
certain current distributions. 
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center conductor and 22 per cent is in the outer conductor. When the 
center conductor resistance is reduced 35 per cent, the attenuation is 
reduced not 35 per cent but roughly 78 per cent X 35 per cent = 27 
per cent. 

From the curves of Fig. 2 we can conclude that, if the current in a 
suitably proportioned three-conductor coaxial line can be distributed so 
that the two innermost conductors carry equal currents, at one par- 
ticular frequency the attenuation can be reduced to 73 per cent of the 
attenuation of a two-conductor coaxial line of the same over-all dimen- 
sions and made of the same materials. The attenuation of the three- 
conductor line will be less than that of the two-conductor line over a 
20:1 range of frequencies, but at extremely low or extremely high fre- 
quencies it will have higher attenuation. 


2.3 How Can the Desired Current Distribution Be Achieved? 


The previous section shows that a coaxial transmission line having 
three conductors, the two innermost of which are very close together, 
will have lower losses than a conventional coaxial cable if the currents 
in the two innermost conductors are approximately equal. Two inde- 
pendent ways to achieve this equal division of current have been sug- 
gested. One way* is to alter the dimensions and dielectric constants so 
that the line has a natural mode of propagation with approximately the 
desired current distribution. The other proposal is to transpose conduc- 
tors in the manner of a litz wire to get the desired distribution by brute 
force, so to speak. This paper describes computations and experiments 
with a line having such transpositions. 


2.4 History 


The idea of transposition between a solid cylindrical conductor and a 
concentric tube is at least as old as U. 8. Patent 1,088,902, issued on 
March 38, 1914, to P. V. Hunter. Fig. 4 of that patent shows a transposi- 
tion similar to Fig. 15 of the present text, and the exact wording of the 
specification states: 

“Tf it is desired to arrange the feeders so that the leads have equal im- 
pedance, the cable may be divided into an even number of equal lengths, 
the inner conductor a of one length being, as indicated in Fig. 4 [of the 
patent] connected to the other conductor b of the adjacent length. This ar- 
rangement also prevents alteration to the division of current between the two 
leads by unequal inductive effects on the two leads of currents in adjacent 
feeders or earth.” 


* Proposed by H. 8. Black and 8. P. Morgan, Jr. 
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Hunter was concerned with power distribution systems. The increase 
in resistance at high frequency resulting from the skin effect is not men- 
tioned in his patent. This is reasonable, for at a frequency of 50 cycles 
per second the increase in resistance of a copper conductor even as large 
as one-half inch in diameter is only 0.34 per cent. One can easily under- 
stand, therefore, why Hunter did not apply his technique to the reduc- 
tion of skin resistance. 

Later, when the nonuniformity of current distribution at high fre- 
quencies became a matter of concern, stranded conductors’ were used 
rather than transposed coaxial cylinders. At frequencies below one mega- 
cycle per second, stranded conductors may be quite advantageous. 
Above that frequency, the fineness and large number of the strands re- 
quired makes the technique less rewarding. 

The work reported in this paper stems from a systematic search for 
ways to combat skin and proximity effects. The beginning of this search 
was stimulated largely by W. H. Doherty during his tenure as Director 
of Research—Electrical Communications at Bell Telephone Laborato- 
ries. 


2.5 Early Experimental Results 


The first three-conductor coaxial line with transpositions was made 
with a center conductor devised by W. McMahon and fabricated by 
him and G. R. Johnson as shown in Fig. 3. Starting with a bare wire 
30 mils in diameter, flat sections 8 in. long, 120 mils wide and 5 mils 
thick were formed at regular intervals along the wire so that they were 
separated by a length slightly greater than 8 in. This was done by placing 
the wire on a hardened steel plate and passing the plate and wire through 
a rolling mill. The flat sections were then formed into channels in a 
simple die. Then the round sections of the wire were insulated with a 
wrapping of plastic tape. Finally, two such conductors were placed side 
by side so that the flat sections (now channels) of one were matched 
with the round sections of the other, as shown in the figure. The round 
sections were laid in the channels, and the whole structure was passed 
through a die of diameter such that it caused the flat sections to be 
wrapped completely around the insulated round sections. Thus, the con- 
ductors alternate as inner and outer conductors along the line, as shown 
in the figure. 

Resistances measured at various frequencies were compared with 
computations based on the approximate formulas of Section 2.2, which 
were, at that time, the best available. The results are shown in Figs. 4, 
5 and 6. While the correspondence between the measured and computed 
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Fig. 3 — Method suggested by W. McMahon for making a transposed com- 
pound conductor. 
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Fig. 4 — Effective resistance of a 1-meter length of coaxial line with a trans- 
_ posed copper center conductor. 
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Fig. 5 — liffective resistance of a 1-meter length of coaxial line with a trans- 
posed copper center conductor of different size. 
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Fig. 6 — Effective resistance of a 1-meter length of coaxial line with a trans- 
posed aluminum center conductor. 
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values is not perfect, it is clear that the computed curves are a good 
qualitative approximation to the measured values. 

At that time, it was decided to make a long test sample for further 
measurement. The experimental length was chosen so that: (1) different 
transposition intervals could be studied, (2) attenuation could be meas- 
ured directly in a straight-through transmission measurement, (3) termi- 
~ nation effects would be small in comparison to transmission effects and 
(4) a control experiment on a two-conductor coaxial line could be easily 
carried out. As this cable was being fabricated and measured, a detailed 
analysis and computation procedure was worked out. Actually, the 
measurements were completed six months before the computations. The 
results of this last experiment and computation are the main subject of 


this paper. 


2.6 Plan of the Paper 


Following this introduction, we shall make a thorough analysis of the 
three-conductor transposed line. In Section III, the propagation prop- 
erties of a uniform three-conductor line are expressed in matrix form. In 
Section IV, a unit of transposed line, made by combining two equal 
lengths of this uniform line and a transposition, is described. A cascade 
of these units is then analyzed as a periodic transmission medium to find 
its new traveling waves and propagation constants. In general, the ex- 
pressions are complicated and numerical analysis is necessary to obtain 
useful results. However, if the spacing between transpositions becomes 
very small, quite tractable forms appear. Further, an approximation to 
these forms yields a very simple understandable picture of the behavior 
of these lines. Following this, Section V is a description of experimental 
work and final results. Details of calculation and tables of. results are 
given in Section VI. 


III. PROPAGATION IN UNIFORM THREE-CONDUCTOR COAXIAL LINES 


Consider the line with transpositions to be made up of a large number 
of equivalent sections each containing a certain length 2/ of transmission 
line and one transposition. For the sake of preserving symmetry, it is 
convenient to place the transposition in the center, flanked by two equal 
lengths / of line, as in Fig. 7. 

It is convenient to use matrix notation to describe the transformation 
of voltages and currents from one end of each of these elements to the 
other because the matrix of the transformation of any number of linear 
networks in cascade is simply the product of the matrices of the individ- 
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—--— ONE SECTION OF LINE WITH TRANSPOSITION IN CENTER-————> 





| 
3 th | 
es eee 
SEVERAL LIKE SECTIONS IN TANDEM 


Fig. 7 — Schematic representation of a three-conductor transmission line with 
several transpositions. 


ual transformations, even though the individual networks are not iden- 
tical (Ref. 6, p. 337). 

First, we wil obtain a suitable mate for the length J of uniform line, 
starting from the differential equations. Carson and Gilbert’ and Schel: 
kunoff’ have shown how to derive a first integral of Maxwell’s equations 
for a structure consisting of uniform concentric cylinders and shells. For 
the purpose of studying three-conductor coaxial lines it is only necessary 
to adapt Schelkunoff’s general results to the specific case at hand. 

Suppose the cross section of the transmission line is as shown in I'ig. 1. 
Let the conductors be numbered, from the innermost out, 1, 2 and 3. 
Let the currents in the several conductots be ie", “Gigi! ad i3¢7"". 
Note that small letters are used instead of the capital letters used before, 
and that the dependence on longitudinal distance x is included. Let the 
voltages of the various conductors be v,e?"’, v2e7"' and v;e""". To avoid 
ambiguity, we may suppose that voltages are measured radially inward 
from the outermost conductor. It follows that 73 = —7 — 27 and v3; = 0, 
and we can eliminate 7; and v3; when desirable. 

Following Schelkunoff, in analogy to his equation (52), we can find 


eH tog = B (a3) — Hal) — 2 ( — w), 





or by .. 
(tag dy F) me 
iat 5 2 log 5° = E,(a,) — Babs) — = (0, — wy), 
Tv 2 Ox 
‘s i Be Ou 
2—- y= aa’ Ee EK 
~~ On(g : Flee ax’ 
1 7) @) 
i a 2r(g + jwe) log bo 02 (a +m). 
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Here a, and b, are the inside and outside radii, respectively, of conduct- 
ing shell n, and /,(a,), £,(b,) are the longitudinal electric field strengths 
at the inner and outer surfaces of the nth conductor. 

To these equations must be added Schelkunoff’s equation (74) (modi- 
fied appropriately in notation) for determining the surface electric fields 
E,: . 

E,(dn) = Vas Pas + ie an Pao ’ 
3 
E (bn) = Fie Lae + Vrs Pan 


where J,‘” and J;‘” are the parts of 7, which flow back inside and out- 
side the nth conductor and the Z’s are the artificial surface impedances,* 
the resistances of which were used above. We see from Fig. 1 that 


I” +1 = tn, 
n—1 
I = —>o tn, 
1 


3 n 
) » ° 3 . (n+1) 
I,” =a lin = in = —I,” ’ 
1 


n+1 


(4) 


I, = 0. 


The terms on the left of the first two equations (1) above are the in- 
ductive reactance voltages resulting from the magnetic field between 
conductors. To simplify the writing, let Xa, and Xa2 be these reactances 
so that 


We a2 
3, 8 5, — Xai; 
(5) 


(Oy 0 a3 . 
a log i Xa2. 
In the second pair of equations (2), voltages and currents are related 
by the radial admittances Y; and Y» of the dielectric between conductors. 
That is, 


2r(g + jwe) _-y 
log (a/b1) ae (8) 
Qr(g + jwe) _y 
: log(a3/b2) = 


* Formulae for the surface impedances for the cylindrical line are given in Sec- 
tion VI. 
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Equations (1) and (2) may now be written 
Ot 
Ox 
Ot 
Ox 


= — Yin + (Y; “ Yo)ve, 
(7) 


du 
Ox 
v2 
Ox 


where 


(Zoo + Zu — 2Za)ir + (Zo — Zar)ia, 


m= (Zoo _ Zan) t1 Zo 12, 


Zu = Ze? + Zea” + jXar (8) 


is the effective series impedance of the simple coaxial line consisting of 
conductors 1 and 2 only, and where 


Log = Zo + Loa + jXe2 9) 
is the effective series impedance of the simple coaxial line consisting of 
conductors 2 and 3 only. The coefficients in these equations are all per 


unit length of line. 
In matrix notation (7) is 


ais 

Ox 
dis fe oy 
Ox 19(2) 

= A’ | (10) 

Ov1 Vy (x) 
peed, ; 
cn v(x) 
Ove 

Ox 


This system has four eigenvalues, which are the roots of the charac- 
teristic equation of the matrix (Ref. 6, p. 314; Ref. 7, p. 111): 


determinant [A’ — yZ] = 0. 
To each eigenvalue y;(7 = 1, 2, 3, 4) corresponds an eigenvector, 


7;1(z) 


Ba 5 152(x) pe dees a) 
Vi = va (x) ? J = 1, 4,9, ; 


vjo(x) 
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such that 
tj al 
, Lj2 iio 7 
A = 4; : 7 = 1, 2, 3,4, (12) 
vj Un 
Vj2 Vj2 


where the notations 


V(z)] = bead and V@) = [ (2), v(x) | 


are used for column and row matrices respectively. 

The following convention regarding subscripts will be used in the rest 
of this paper: If a voltage or a current bears two subscripts, the first re- 
fers to the number of the eigenvector or mode, and the second to the 
number of the conductor. 


3.1 Traveling Waves and Eigenvectors 


A set of certain simple solutions of the system (10) can be expressed in 
the form 


in(a + £) = Mi(x) = e* (x), 
ix(t +.£) = Na(x) = e% in(2), 
: (13) 
v(x + £) = dvi (x) = e” v,(z), 
vo(2 + £) = An(x) = e% v2(z). 


To a transmission engineer, each such solution is a traveling wave with 
a propagation constant y. To a mathematician, each such solution is an 
eigenvector with an eigenvalue \. More general solutions can be expressed 
as linear combinations of such elementary solutions. In this particular 
case, as we saw above, there are four traveling waves or eigenvectors, 
and the solutions are 


41(2) = 1 e7* i Ine” oe tsie°* ar ine*, 
a(t) = tye" + toe?” + dgge™ + de®, 
01(2) = Un er oF 67" ++ V31 et =“ vn e*, 


v(x) = vye™™ + vee” + vyge™™” + vye™*”. 


(14) 


The eigenvalues y; , corresponding to the 7 normal traveling waves of 
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the system described by A’, are isolated and shown clearly by factoring 
A’ into the canonical form (Ref. 6, p. 314; Ref. 7, p. 111): 


A’ = LTL", (15) 
where _ 
1 0 0 0 
eee . 
O 
and 


| daa don U1 tar | 
212 122 132 142 = Ee le cae 
Vir Var U31 “i Vi] V2] V3)V4) , 


Vy12 Vee Usa V4 


an 


Because A’ has special properties, in particular, 


4 
Al = (18) 
Z 0 


the set of four equations (10) can be reduced by further differentiation 
to two separate sets of two equations: 


any [on 
Ox? 14 02? V1 
Pa Gee 24 : se pa ee ; (19) 
0 te to OV2 V2 

Ox? Ox? 

Because YZ is the transpose of ZY, the eigenvalues of these two sys- 


tems are the same. Further, because the solutions have the forms (13) 
and must satisfy a relation like (12), we have 


YZ "| ae a (20) 
i i 


Since the eigenvalues y:° and 3" of (19) are the squares of those of (10), 
we have ; ee ee ae 





Y1 = 7Y72; ys = TY74- (21) | 
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They may be calculated from 
determinant [YZ — y’J] = 0, (22) 


as indicated above. 

The ratios of magnitudes of the several traveling waves in the two 
conductors may be calculated from (20) and a similar equation for the 
voltages after the y’s have been found. 

It may be noticed that if Z., = 0, we have y’ = YiZy or YoZ, cor- 
responding to two independent coaxial lines. This would be the case if 
the intermediate conductor tube were very thick or the frequency very 
high. 

The above results may be extended to any number of concentric tubes. 
For each added tube, two more equations are added to the original set 
of four, and one additional distinct natural mode which may travel for- 
ward or backward appears. The number of homogeneous equations [cor- 
responding to (20)] is one less than the number of conductors. 

The computation for the three-conductor case we considered was done 
by machine. The results will be described in Sections V and VI. 


IV. TRANSMISSION PROPERTIES OF LINES WITH TRANSPOSITIONS 


In order to combine sections of uniform line and transpositions as in- 
dicated at the beginning of Section III, we need the relation between 
currents and voltages into and out of a line section of length J. This is 


ade + l) 11 (x) 
ix(x + I) 42(x) 
=A (23) 
v(x + 1) v1 (x) 
| vox + 1) v2(x) 
The matrix A is related to the matrix A’ discussed in Section III by 
A = LAL", (24) 
where* A = exp (TZ) 


is a diagonal matrix with elements \; = e”*’. This throws into clear view 
the relation between the differential equations of the line and the trans- 


* The form (23) is derived from that of (10) by considering n sections of length 
Az in cascade, calculating 7(ao + nAz) from [i(ro + nAz) — 7(20)J/Az, and then 
keeping nAz = | while Az — 0. This is a case of Sylvester’s theorem. A treatment 
is found in Ref. 7, p. 119, where polynomial functions are considered. A general 
proof for converging power series, including exponential series as a special case, 
follows by taking the limit as the sum of a polynomial series. 
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Fig. 8 — Schematic representation of a transposition in a three-conductor 
transmission line. 


mission properties of a finite length. The eigenvectors are the same, and 
the eigenvalues are related through the equation 


NG = et. (25) 


where A; = 1/2 and A3 = 1/4 correspond to (21). 
The transposition shown in Fig. 8, whose matrix we shall call 7, is 
governed by the equations 


/ 


vy = Ve, 
as / 
a ene (26) 
a ay 
Uy =k, 
“ 4 
ga =U, 


where the primed and unprimed letters refer respectively to the two sets 
of terminals of the six-terminal network. In matrix notation, 


01 0 0 
100 0 
T = (27) 
000 1 
00 1 0 


The line with transpositions is made up of a large number of equivalent 
sections, each containing a certain length 2/ of transmission line and one 
transposition at the center, as in Fig. 7. The matrix of the transforma- 
tion becomes 


ATA = LAL 'TLAL™. 


Remember that A depends on the length of the section of line, although 
DL, Le and T do not. 
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The characteristic equation becomes 
det [LAL 'TLAL' — v1] = 0. (28) 


The matrix can be modified without changing its determinant by multi- 
plying on the left by L' and on the right by L, to get 
det [AL 'TLA — vI] = 0. (29) 


Inasmuch as the matrices Z and Y [see (7)] are both symmetrical, ZY 
is the transpose of YZ. Consequently, there is a set of orthogonality re- 
lations among the respective eigenvectors, as follows: 


IjVul = tyir + tj2de2 = 0 


if 
vi A skye or Apt 1/AjAM+ 1/\ 
and 
IVi] + Vik} = 0 
if 


Yi A%ve Or Ny Are. 


The use of these relations simplifies the calculation of L7'7L con- 
siderably. The first step is to find L™: 





Pvu pivie- Pit Prtie pV pir 

= P21 P2Ve2 Dolor Dolo p2Vo Pol 2 
iS Cg Se es (30) 

D331 P3ds2 Dsts1 Dsts2 PsVs Pals 

PaVa1 P4V42 pata Datao paVa pal 

where 
— (31) 
a BL 


Using, in addition, the relations (21) and consequences, straightfor- 
ward but tedious multiplication yields for the characteristic equation 
(29) 


vi — Qi + As” — Ag’ — AL) 
+ ¥[2Qr — (Qe + Qe’) (AeA + ALAS) 
— (QP — Qs)(g As’ + APAL)] 
~ QQ +A’ — As — MP) +1 = 


(32) 
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where 


Qi (V12V32 — Vis1)/A, 
Q2 = [WV pi/psQou’ — v2") + VW ps/pilvse’ — vs1°)]/2A, 


Qs = [|v pi/ psu = Ve’) — V?P3/ Di (Use a vs')]/ 2A, 
A = VyV32 — 042031 « 


Equation (31) defines p; and 73, and 
Qf + Q - QF = 1. 

As anticipated, the equation has symmetrical coefficients, which means 
that the roots occur in symmetrical pairs, and also means that a substi- 
tution vy + 1/v = £ reduces the degree to two. The roots v; of the charac- 
teristic equation (32) are the eigenvalues of the system and are related 
to the effective propagation constants yr; of a transposed line by v; = 
e’T;’, By this means, the propagation constants of the transposed line 
were derived from the eigenvectors and propagation constants of a sec- 
tion without transposition, with the aid of matrix properties of the line. 
The actual numerical computation was performed on a large electronic 
computer for several different transposition intervals, and the results 
are given both graphically and in tabular form in Sections V and VI. 


4.1 Infinitesimal Spacing of Transpositions 


A special case of the general results just given yields results with much 
less numerical calculation and throws interesting light on the whole sub- 
ject. From it, a very simple picture of the action of transposed lines is 
derived. 

Consider the situation when the uniform line sections on either side of 
a transposition have length dx instead of /. The matrix A now may be 
formed from A’ by calculating 7(2 + dz), ete., from dz:/dz. In this way 
we get 


1 0 Y, dx —Y, dx 
i 0 1 —Yidx (Y¥i:+ Y2) ‘J 
(Zo + Zu — 2Za) du (Zo2 — Zar) dx 1 0 
(Zon — Zan) de Zon dt 0 1 


From this, the matrix A7'A for an element of transposed line is ob- 
tained easily. But, for present purposes, it is desirable to maintain the 
positions of 2(a), to(x), ete., in the column matrices of (23), and so we 
will.consider a unit of length 4 dz having a matrix ATAATA. 
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Just as was found in Section III, it is easier to handle a system 
like (10) with matrix A’ when the line is divided into equivalent. sec- 
tions of infinitesimal length. To do this, 4077/02 is calculated from 
[tm(x + 4dx) — tm(x)]|/dzx, etc. The transposed line composed of these 
very short elements is described, as in (10), by 


alr] 


“Ox I "] 

a Ae , (33) 
dV rl Vel 
Ox 


where 
0 Yr 2Y, + Y,/2 -. Y; 
Ar = ’ Yr = ’ 
| Zr O —YV, 2Y,+ Y./2 


[’- aA ra e/a 


and 


= 


222 = Lab Zo a 2/2 ? 4 Zab 


The eigenvalues yr; of (33) occur in pairs and are calculated from de- 
terminant [YrZr — yr/] = 0. The eigenvectors 77;] and V,r,] are cal- 


culated from 
Fee as or 
VYolLZr hal = rj bea , etc., (34) 
trj2 trj2 


as before. The results are 
(Low loss mode) 
yn = Y(Z2— Zao+ Zu/4), ve = —In; 
ty = ty = vn Y2/2ym = veY2/2yn, 


toy = dog = Vy Vo/Qyr2 = U2V2/2y72; (35) 
(High loss mode) 

yr = (Vit ¥2/4)Zu, 9 yr = —¥28, 

13, = —t32 = 03(4V1 + Vo)/2y7r3 = —v30(4V1 + Vo) /2y7s , 

ty = tg = On(4V + Vo)/2yr4 = —02(4A¥i1 + Yo)/2yr: , 


where the 7 subscript on the voltage and current components has been 
dropped for simplicity. 
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It is clear in this case that one mode has the expected current distri- 
bution, 1.¢., 21. = ?2, in which equal currents flow in the two inner con- 
ductors. This confirms the heuristic analysis in an earlier section, where 
we assumed that such a current distribution could be achieved and con- 
jectured that it could be achieved by transposition at sufficiently short 
intervals. A detailed comparison with the estimate of loss Z in Section 
2.2 shows that the predicted loss is the same as estimated before. How- 
ever, the present analysis is more revealing, because it not only tells the 
loss of the desired mode, but also shows the losses and current and 
voltage distributions of both modes. A knowledge of the current and 
voltage distributions is necessary to launch one mode without exciting 
the other. In this case, connecting the two inner conductors together, so 
that v, = v2 at the launching point, is enough to guarantee that the un- 
wanted mode cannot be excited, for in the unwanted mode v3, — v32 does 
not equal zero. 


4.2 Simple Picture of Attenuation in the Transposed Line 


An interesting picture of the operation of the transposed three-con- 
ductor line can be obtained from (35) for yr’ when the transposition 
interval is very small, (ym being the propagation constant for the low- 
loss mode). In this expression, Y2 = 7S: may be taken as the radial ad- 
mittance and Z22. — Za + Zu/4 as the series impedance of an equivalent 
two-conductor coaxial. If the series impedance is R + 7X and R is very 
much less than X, we have approximately 


R ——=—_ 
Yn = 9 : + iV 8X. 


The attenuation (r.p. yr) is proportional to the resistance of the series 
impedance, and we will consider only this: 


R = Reo» ~~ Rap a Riu/4 


RO 4 Rg 
4 4 ° 


Here, Raa” is the resistance in ohms per meter of length of the outer con- 
ductor; Ry and Ra.” are the surface resistances along the outer and 
inner surfaces of the intermediate tube; R., is the transfer resistance of 
this tube and Ry” is the surface resistance of the inner solid conductor. 

Because the intermediate tube is electrically thin for the frequencies 
of interest, Ry” and Rw» are quite flat as functions of frequency and 
very nearly cancel each other. Eventually, Ry” begins to increase and 
Ra» to decrease with frequency. Until this frequency is reached, the re- 


. = Rea’ + Ry” = Ras +e 
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sistance of the central conductor of the equivalent two-conductor coaxial 
is given very closely by 


Res = Roa + Tepe |: 


A physical picture of the hypothetical center conductor which agrees 
with this expression is shown in I'ig. 9 for a line n meters long. This has 
a real correspondence to the actual transposed line in that each of the 
two inner conductors consists of alternate sections of intermediate and 
inner conductors having the resistances per meter of the two terms above, 
and each carries approximately the same value of current. Of course, the 
transposition interval is not necessarily one meter, as indicated in the 
figure. Fig. 9 also shows a representation of the central solid conductor 
of the reference two-conductor coaxial. It is assumed here that the in- 
termediate tube and its inner insulation are very thin, so that their 
combined diameters are essentially the same as the diameter of the solid 
reference conductor. Its resistance per meter is then Ra = Ry”. 
Formulae for calculating the surface impedances are given in Section 
VI. The first terms of each of the approximations (38) are suitable for 


K Bi SS aah ae eae MiIMETERS rea ee ee > 
2 (1) 2) (1) (2) (1) 
| ae Rbb ae Rbb Raa Rbb | 
om am am oe a a W\y—— 
(1) (2) (1) (2) (1) (2) 
Rbb Raa Rbb Raa Rbb Raa 


CENTER CONDUCTOR OF 2-CONDUCTOR COAXIAL LINE 
EQUIVALENT TO 3-CONDUCTOR TRANSPOSED LINE 
Dp (2) nN pf) 

> Raat > Rp 


b 2 (1 
TOTAL RESISTANCE = 5 =f ee + Rob) 


_— 1 52) 1) 
RESISTANCE PER METER = ~ (Raa + Rbb) 


(1) (1) (1) (1) (1) (1) 
Rbb Rbb Rbb Rbb Rbb Rbb 
CENTER CONDUCTOR OF REFERENCE 2-CONDUCTOR COAXIAL 
(1) 
TOTAL RESISTANCE = NRbb 


(1) 
RESISTANCE PER METER = Rbb 


Fig. 9 — Low-frequency representation of inner and intermediate conductors 
in three-conductor transposed line. 
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present purposes and are reliable even at low frequencies for a thin- 
walled tube. Thus 








1 1 
FO 2 a Rea oe fae 
27 a2 ot 2ragt’ 
ae 
R (1)? ses Ble 
bb 2b, = 
using (37) also. If, in addition, be, az and b; are approximately equal, 
Rat: 228 
Ryn” ~ i? 
where 
= poe 
mugs 


is the skin depth and ¢ is the thickness of the intermediate conductor. 
Then, the ratio of center conductor resistance in the three-conductor 
transposed line to that of the two conductor reference line is 


Reg AV fé 
goat tes) 


At the frequency for which skin depth equals intermediate conductor 
thickness, the resistance is cut in half. If 78 per cent of the cable loss 
occurs in the center conductor, a 39 per cent reduction of attenuation 
would result. This is quite a bit more than was found experimentally. The 
difference is that, even for 6/t = 1, Ra does not cancel Ry” exactly. If 
the third terms in coth ot and csch ot (the second terms give reactance 
only) are added to the above simple approximations, we find 


Res 1/6 1/t\ 
Now, when 6/t is 1, K equals 0.667 and a 33 per cent reduction of re- 
sistance or 25.8 per cent reduction of attenuation is obtained. This is 
very close to the value computed from the whole expression for one-mil 
polyethylene (J’ig. 18). When the cubic term is appreciable, the simple 
picture of Fig. 9 is no longer sufficient. 

We find K equals one for 6/f = 3 and 6/t = 0.66. The first is the 
lower-frequency crossover point (0.54 mc) and thesecond is the higher cross- 
over point (11.1 mc) where the three-conductor transposed line has the 
same loss as the reference two-conductor line. These values agree quite 
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Fig. 10 — Cross sections of two-conductor and three-conductor coaxial lines 
used in experiment. 
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Fig. 11 — Photomicrograph of cross section of the inner conductors of the 
three-conductor coaxial line. 
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well with those calculated from the full expression for attenuation and 
plotted on Fig. 18. 

The above simple expression for center conductor resistance reduction 
thus gives a good description of three-conductor transposed line per- 
formance, provided the radii of the two conductors are nearly the same. 

If two thin shells surround an inner conductor to make a four-conduc- 
tor transposed line, the simple picture of Fig. 9 may be generalized to 


give 
Rea 1/6 
ie 2 = 


for low frequencies. At 6/t = 1, this would yield a 67 per cent resistance 
reduction or 52 per cent attenuation reduction. Naturally the cubic term 
is bigger than for the three-conductor line, and its effect would be felt 
at lower frequencies. 





V. EXPERIMENT AND COMPARISON WITH THEORY 


As mentioned above, it was predicted that there would be a reduction 
of attenuation in a coaxial line whose center conductor was laminated, 
with the laminations transposed periodically along the line. Confirmation 
of this prediction by resistance measurements of a three-foot length of 
such a line led to the decision to test the idea more fully by transmission 
measurements on a long section of line. 


5.1 Making the Cable 


This required the manufacture of about a mile of three-conductor co- 
axial cable of unusual design. A design and a method of fabrication were 
worked out in cooperation with members of the. Western Electric Com- 
pany. Fig. 10 shows a cross section of the finished cable and its dimen- 
sions. The inner conductor is 19-gauge copper wire over which poly- 
ethylene was extruded to a thickness of 5 mils. Using a special die, the 
intermediate conductor was made by forming a copper tape about 1 mil 
thick and about 150 mils wide over the insulated wire, leaving a longi- 
tudinal seam. Ordinarily such a seam must be held closed with a helical 
wrapping of some kind, but this was found unnecessary when the second 
and comparatively thick layer of polyethylene dielectric was extruded 
over this intermediate conductor immediately after it had been formed 
by the die. One factor contributing to the success of this operation was 
the leaving of a small gap in the seam rather than an overlap. Micro- 
scopic examination of cross sections (see Fig. 11) showed the gap to be 
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about 5 mils or less. Capacitance measurements also indicated the pres- 
ence of a slight air gap a few tenths of a mil thick between the inner and 
intermediate conductors. The final operation consisted of forming the 
outer conductor, using a copper tape 4 mils thick by 0.600 in. wide and a 
special forming tool. In this case, there is a small overlap at the longi- 
tudinal seam, which is held closed by a helical wrapping of paper tape, 
5 mils by 0.750 in., applied 20 wraps per foot of cable. The effective air 
gap between the intermediate and outer conductors is about 1 mil, as 
determined by capacitance measurements. 

At the same time this cable was made, a two-conductor coaxial cable 
was also made for comparison purposes. The central conductor of this 
cable is a solid copper wire of 0.0470-in. diameter, which is just a little 
less than the outside diameter of the intermediate conductor of the other 
cable. For both cables, the polyethylene dielectric was extruded through 
the same die and the outer conductor was made from the same batch of 
copper and applied in the same way. A cross section with dimensions for 
this cable is shown in Fig. 10. Sections of both cables are shown in Fig. 
12. 

The use of a reference cable made in the way described makes it un- 
necessary for the testing of the attenuation reduction idea to depend on 


0.048" D 19 GAUGE 
COPPER WIRE COPPER WIRE 





0.005" WALL POLYETHYLENE ————> «, 





<—_—————— 0,005" x 0.75" PAPER ——-—-_—_> 


TWO CONDUCTOR THREE CONDUCTOR 


Fig. 12 — Photographs of two-conductor and three-conductor coaxial lines 
used in experiment. 
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a comparison of measurement with calculation. Rather, it depends on 
the comparison of the measured attenuations of two similar real cables, 
one having a solid central conductor, the other having a laminated cen- 
tral conductor in which the laminations are transposed regularly. 

Open and short-circuit impedance measurements showed that the 
characteristic impedance of the three-conductor cable is very near 50 
ohms. According to the estimates of Section IT, the frequency at which 
maximum reduction of attenuation occurs is determined by the thick- 
ness of the intermediate conductor. For the cable described above, this 
is about 4 me. 

Because of the way in which they are made, the intermediate and 
outer conductors will buckle or even break if the cable is bent with too 
small a radius. To insure further against possible buckling or breaking, 
it was planned to handle the cable as little as possible. Hence, it was de- 
cided to re-reel the cable in a single layer onto suitable drums as the trans- 
positions were made, so that access to any part of the cable would be 
possible without additional unreeling and re-reeling. 


5.2 Making the First Transpositions 


After the cable was fabricated as described above, the major part of 
the laboratory work consisted of devising and making satisfactory trans- 
positions between the inner and intermediate conductors at regular in- 
tervals. At a point where a transposition was desired, the cable was cut 
and conductors and dielectric were stripped back so that all three con- 
ductors were laid bare at both sides of the cut. By means of special jacks 
and a plug, the center conductor on the left side of the cut was con- 
nected to the intermediate conductor on the right side of the cut and the 
intermediate conductor on the left side was connected to the center con- 
ductor on the right side. The two outer conductors were then connected 
together. Fig. 13 shows the cable ends connected to the jacks. 

Because of the fragile nature of the intermediate conductor and its 
supporting insulation, considerable care had to be exercised in making 
the transpositions. Ragged edges on the 1-mil copper had to be guarded 
against to avoid puncturing the 5-mil polyethylene. Damage to the insu- 
lation from excessive heat was avoided by using low-melting-point solder 
for making electrical connection to the intermediate conductor. 


5.3 First Measurements, Showing the Effect of Too Long a Transposition 
Interval 


At the two ends of the cable, the inner and intermediate conductors 
were connected together to serve as one input and output terminal. Fifty- 
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ohm terminations were used. The results of the transmission measure- 
ments are plotted on Fig. 14, along with those for the reference two- 
conductor coaxial. These were disappointing because, in the frequency 
region where a reduction of loss was expected, two large peaks of in- 
crease were found. Measurements made after changing all the patching 
plugs for straight-through connection gave results very nearly the same 
and very nearly as smooth as those obtained from a continuous piece of 
cable of the same length. Apparently the transpositions and not the dis- 
continuity at the section junctions were responsible for the peaks. With 
only one transposition at the center of the 1180-foot section, there were 
+0.2-db ripples in the loss curve. The effects of reflection at the uni- 
formly spaced transpositions accumulated, with the results already seen. 
Investigation showed the transposition interval to be one-quarter wave- 
length at the first peak and three-quarters of a wavelength at the second. 
Spacing the transpositions twice as far apart lowered the frequencies of 
the peaks by a factor of two. If there was indeed a reduction of attenu- 
ation between 1 and 10 me, it was covered up by the reflection effects. 
At this point, a thorough calculation of transmission loss was begun to 
help understand the phenomenon. Further experimental work was un- 
dertaken at the same time. Since no peaks appeared at frequencies below 
that corresponding to an interval of one-quarter wavelength, it seemed 
reasonable to expect the looked-for reduction of attenuation if the trans- 
position interval was shortened so that the frequency for which it was 
one-quarter wavelength was outside the desired frequency region. Using 
our previous experience as a guide, a spacing of about 9 feet was chosen. 
With this spacing, the first peak should have been at around 20 me. 





a 


Fig. 13 — Technique for making cable connections at a plug-in transposition 
point. 
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Fig. 14 — Attenuation of line with 75-foot transposition spacing (approx.) com- 
pared with that of reference two-conductor line and with computations. 


5.4 Final Measurement and Comparison with Theory 


The plug-in transposition in Fig. 13 was too elaborate to be used in 
such large numbers, and not too satisfactory. Therefore, a permanent 
transposition, shown in Fig. 15, was devised. A continuous cable having 
113 sound transpositions and 114 sections in a total length of 1057 feet 
was made ready for test. This time, after transmission measurements 
had been made and compared with those for the two-conductor reference 
line, the expected reduction of attenuation was found, as shown in Fig. 
16. The reduction at 4 me in the center of the region is 17 per cent. 

While the rough estimate of reduction made in Section II was 27 per 
cent, it should be pointed out that this depended on the assumption that 
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Fig. 15 — Cutaway illustration of permanent transposition. 


the diameter of the solid inner conductor be substantially the same as 
that of the tubular intermediate conductor. In the actual cable, the di- 
ameter of the solid core is 28 per cent less than that of the surrounding 
tube, mainly because of the 5-mil polyethylene insulator. Therefore, a 
reduction of only 15 or 20 per cent should be expected. The actual re- 
duction is right in the middle of this range. 
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Fig. 16 — Attenuation of line with 9-foot transposition spacing (approx.) com- 
pared with that of the reference two-conductor line. 
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Fig. 17 — Calculation of attenuation with transpositions at approx. 9 feet, 
approx. 43 feet and very small intervals; comparison with measurements where 
available. 


The calculation of transmission loss described in Section IV was fin- 
ished after the measurements. The results are plotted on Tig. 17 and 
show very good agreement with the measured data. 

The greater attenuation at very low frequencies is caused mainly by 
the transpositions and not by the fact that the central conductor cross 
section is 38 per cent less in the three-conductor cable than it is in the 
two-conductor one. The central conductor of the transposed line at de 
consists of two like structures in parallel. Each of these is a conductor of 
many series sections, with inner and intermediate sections alternating. 
Thus, the considerably higher-resistance intermediate sections dominate 
the situation. At very low frequencies, the attenuation ratio is approxi- 
mately 2, which is slightly greater than the value 1.6 shown at 100 ke. 


5.5 Transposition Interval 


In the calculation it was very easy to shorten the transposition inter- 
val. Two such cases are plotted in Fig. 17. When the spacing is just one- 
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half as long as before, the loss curve is nearly the same, but it crosses 
over the reference cable loss at about 10 mc instead of at 8 mc. Thus, 
the reflection peak at about 20 me (not shown on the figure) pulls up the 
loss curve a little at the high end. The other case calculated was for in- 
finitesimally small spacing of transpositions, and this gave substantially 
the same loss curve as the 43-foot spacing. We conclude that the reflec- 
tion peak at 40 mc has a negligible effect on the loss curve in the region 
of improvement. Also, it is seen that the 17 per cent improvement ob- 
served experimentally would be raised to 18 or 19 per cent by spacing 
the transpositions at about 4-foot intervals. | 

The attenuation was calculated also for the cable with transpositions 
75 ft 9 in. apart. The results, in good agreement with the measurements, 
are plotted on Fig. 14. While a random longer spacing of transpositions 
would greatly reduce the sharp peaks of loss in Fig. 14, the way to get 
the smoothest loss curve with the greatest reduction of loss is to make 
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Fig. 18 — Calculated attenuation of a transposed line similar to that used in 
experiments, but with 1-mil dielectric thickness between innermost conductors. 
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the spacing small enough so that the first peak is well beyond the desired 
operating region. 


5.6 Lffect of Spacing Between Inner and Intermediate Conductors 


Another result should be of interest here even though it was not ob- 
tained experimentally. A calculation for loss was made for a line in which 
the inner conductor was enlarged so that the polyethylene separating it 
from the intermediate conductor was only 1 mil thick instead of 5 mils. 
The results are plotted on Fig. 18. All curves are calculated so that they 
are on the same basis. At 4 mc, the transposed three-conductor with 
5-mil polyethylene shows an improvement of 21.2 per cent, and the 
transposed three-conductor with 1-mil polyethylene shows an improve- 
ment of 26.9 per cent. . 

The latter case approximates the original assumption of equal diame- 
ters much more closely than does the former, and shows an improvement 
very nearly the same as the 27 per cent prediction of Section II. It was 
realized in the beginning that the dielectric separating the inner and 
intermediate conductors should be as thin as possible, but 5 mils was a 
much more practical figure both for fabrication of the cable and for the 
transpositions. These results serve to point the way to further improve- 
ments. 


5.7 Conclusions 


The experiments have compared the transmission loss of two coaxial 
cables which are the same except that the center conductor of one is 
solid while that of the other consists of a shell surrounding and insulated 
from a center core. It has been shown that the transmission loss is less, 
over a certain frequency band, for the cable with the laminated center 
conductor, provided the laminations are transposed at sufficiently fre- 
quent intervals. Also, calculations of transmission loss show very good 
quantitative agreement with measured values, thus tieing the theory to 
the practice. Further, the results show that, if the insulating space be- 
tween intermediate and inner conductors can be reduced to a thickness 
negligible compared to its radius, the reduction of loss can be greater 
and will apply over a greater frequency band. 


VI. NUMERICAL CALCULATION 


6.1 Surface Impedances and Propagation Constants for Uniform Line 


The conductor surface impedances referred to in Sections II and III 
may be calculated from formulae given .in Schelkunoff’s paper.’ 
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When the calculations were started, no attenuation reduction had been 
found experimentally, even though measurements had been made. 
Hence, we wanted to make the calculations as exact as possible in order 
that no small effect in the uniform line properties, which might have a 
large effect in the transposed line, would be overlooked. Therefore, the 
exact expressions for the surface impedances [equations (75) in Ref. 2] 
were used as far as possible. Expressed in terms of modified Bessel func- 
tions of the first and second kind, J, and K,,, respectively, the results are: 








Z one Io(ob) 
Qnb, 1(0b,)’ 
ZO 7 Ty(oa2) Ki (ob2) + Ko(ode)L1(obe) 








27 az I\(cb2)K (a2) = I1(cd2)K1(abs)’ 





Z Cena I ola be) Ki (cae) + Kolo bo) I 1 (ca2) (36) 
ms 2rbe I,(ob2) K1(aaz) = I,(cd2) Ky (abe) 
Dae = : : 


os 2rgarby T1(ab2) Ky (ca) = T,(ca2)Ky(cb2)’ 


z © — 2 Toloas) Ki(obs) + Ko(oas) Ii (obs) 
= 273 I;(ob3)Ky(casz) = I,(caz)Ky(ob3) 


In these expressions, the b’s and a’s are the radii of the various conductor 
surfaces as shown on Fig. 1, and y and o are the intrinsic impedance and 
the propagation constant of the conductor material. They are 


(1 + 1) Wafug, 
i= /-1. 


The dimensions of the cables as supplied by Western Electric are: 


o 


b: = 0.0178 in. = 4.52 X 10“ meter, 
a2 = 0.0234 in. = 5.95 X 10° * meter, 
b. = 0.0246 in. = 6.25 & 10° meter, 
a3 = 0.0862 in. = 2.19 X 10°° meter, 
bs = 0.0902 in. = 2.29 X 10° meter. 


_ For the two-conductor reference line, az and 63 are the same as above and 


by’ = 0.0235 in. = 5.97 X 10° meter. 
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The cable material constants as supplied by the makers of the cable are ° 


w= po = 44 X 10 henry/meter 


peer 
= 2.26 = 2360 xX 10° farad/meter 


g = 5.858 X 10’ mho/meters 


v 


(at 68°F this is 101 per cent IACS). 

These impedances were calculated by means of the IBM 650 computer 
from 100 ke to several megacycles at points uniformly spaced on a loga- 
rithmic frequency scale. Each frequency point is higher than the one 
preceding it by a factor of 10°". In the main program, the libraries used 
for computing the modified Bessel functions at each point were prepared 
by Miss M. C. Gray. 

For sufficiently high frequencies, approximate formulas for the surface 
impedances are satisfactory. These are: 


@ 1 i 
lw = Qrb, E + oh 
(2) _ = a ah 
Lua = a | eoth a(bz — az) 8 
ee (: = mn coth? o(b, — a) | 
8c ae 
G+ 
So Qe be 
: (38) 
1 1 2 
— (2 i) coth’ a(be a) | 


E (i - ‘| coth o(b. — a) | 
Zor = n 8a be 
or VY Agbde sinh a(be = a2) 


1 3 
ona | eoth a(bs — a3) — a(t + 3) 


3 fil 1 2 
Ge € :) coth o(bs a) | 


The higher-order correction terms are not the same as the ones given 
by Schelkunoff’s paper” which, as pointed out by Miss Gray, are incor- 
rect. These also were computed using the IBM 650 machine. However, 
they went through much faster than the exact ones, and so all the fre- 


mn 
| 





Lie = a am | eoth a(be = 2) + — 


(3) 
Z aa = 
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quency points (25) from 100 ke to 25 mc were computed. As it turned 
out, the approximations were very close to the exact values of impedance 
except at the lowest few frequency points, and the greatest differences 
were in the reactances. 

The resistance and reactance of Zy°” (surface impedance of the inner 
conductor) are fairly close to one another, increasing about as ~/f and 
having values of 0.1 ohm near 1 me. The three resistances Rae”, Res” 
and Ra of the intermediate conductor are nearly the same and constant 
at about 0.15 ohm up to about 3 me, where the first two begin to increase 
and R.» begins to decrease. The corresponding reactances vary linearly 
with frequency over most of the range, the first two being positive and 
equal to the resistance at about 0.7 mc, and X,., being negative and equal 
to Ra at about 1.5 me. The resistance Raa” begins to increase from 0.015 
ohms around 0.4 me. All these values are per meter of length. 

In the third calculation using the IBM 650, the inductive reactances 
Xa and Xq. (5) of the space between conductors were computed and 
combined with the proper values of surface impedances at each of the 
25 frequency points (exact as far as they went and approximate beyond) 
to give the series impedances Z1, and Z2, (8) and (9). The dielectric 
admittances Y; and Y2 (6) and the transfer impedance Z,, were then com- 
bined with these to form the coefficients of the characteristic equation 
(22). The four roots (two distinct) of this equation are the propagation 
constants of the uniform three-conductor coaxial line. Voltage and cur- 
rent ratios were computed at each mode, using (20). These quantities 
are shown in Table I. 

The numbers tabulated in Table I contain eight significant digits, Just 
as they came from the IBM 650 computer. This precision would not be 
justified by the primary data if these were final results. But they are 
intermediate results which were used as input data to the computer for 
the transposed line calculation. 


6.2 Transposed Line 


From the characteristic equation (32) for the transposed line section 
of length 22 having one transposition at its center, the four roots »; 
were found and, from these, the four (two distinct) propagation con- 
stants yr; were calculated. This was done for the transposition interval 
21 = 73 ft 9 in. of the first experiment and the interval 21 = 9 ft 3} in. 
of the second experiment, and for an interval of 4 ft 7% in. not done ex- 
perimentally. Only the A; vary with transposition interval 2], since the 
Q’s depend only on current and voltage ratios of the natural travelling 
waves. These quantities are shown in Table IT. 


Frequency, mc 


TasLe I — Unirorm Tures-Conpucror Ling — y1 = a1 + jh1 





10'a1 


10°68; 





























nepers/meter radians/meter 
0. 10000000 3.4688831 0.38348735 
0. 15848935 4.0799877 0.52214634 
0.25118873 4,8395041 0.81834244 
0.39810738 5.7671483 12854328 
0.63095778 6.9660247 2.0228904 
1.0000008 8.5580372 3.1876156 
1.5848947 10.693456 5.0275308 
2.5118893 13 .462726 7.9349729 
3.9810768 16 .887865 12.532983 
63095828 21.1385279 19 .809733 
10.000016 26 .526402 31.329134 
15.848960 33 .384128 49 567992 
25.118912 42 .047592 78 .451609 
Soy 2 
Frequency, mc pr cee odaaa ects 
0. 10000000 4.7318826 0.62320343 
0.15848935 5.6942080 0.84536090 
0.25118873 6.7525699 1.1723877 
0.39810738 78427964 1.6701939 
0.63095778 8 .9337375 2.4480157 
1.0000008 10.044322 3.6761608 
1. 5848947 11.241957 5.6192649 
2.5118893 12.643512 8.6900187 
3.9810768 14.433451 13 .586061 
6 .3095828 16.928777 21.1744383 
10.000016 20.664341 33.195442 
15.848960 26.361051 52.069905 
25.118912 34.329361 81.679447 
Frequency, mc tio/in (t.p.) tefin (ip.) 
0. 10000000 0.18362010 0.16630834 
0.15848935 0. 22474330 0. 22944720 
0.251188738 0.27695130 0.30501473 
0.39810738 0.33751280 0.40492969 
0.63095778 0.40881000 0.54361980 
1.0000008 0.48911890 0.738916345 
1.5848947 0.56956110 1.0221795 
2.5118893 0.62801670 1.4417420 
3.9810768 0.60888470 2.0726476 
6.3095828 0.37242140 3 .0234415 
10.000016 —0.41964610 4,4235045 
15.848960 —2.5566098 6.2815904 
25.118912 —7.7731939 7.8207114 
Frequency, mc t32/ta1 (r.p.) t32/ta1 (i-p.) 
0.10000000 —1.1816245 0.025519684 
0.15848935 —1.1729224 0.032395813 
0.25118873 —1.1624073 0.038792862 
0.39810738 —1.1501389 0.045454252 
0.63095778 —1.1354390 0.052262004 
1.0000008 —1.1181120 0.058627941 
1.5848947 —1.0980730 0.063870189 
2.5118893 —1.0754678 0.066832765 
3.9810768 —1.0512315 0.065599039 
6.3095828 —1.0272895 0.060118625 
10.000016 —1.0063910 0.048717882 
15.848960 —0.99185120 0 .032878953 
25.118912 —0.98612750 0.016017054 
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TABLE I — Concluded 














Frequency, mc vie/vu (r.p.) v2/tu Ci.p.) 
0. 10000000 0.84589800 0.018268992 
0.15848935 0.85192153 0.023529840 
0.25118873 0.85932664 0.026878194 
0.39810738 0.86810431 0.034308091 
0.68095778 0.87885470 0.040451987 
1.0000008 0.89191254 0.046767278 
1.5848947 0.90761553 0.052792147 
2.5118893 0.92625098 0.057560082 
3.9810768 0.94753039 0.059488464 
6.3095828 0.97011293 0.056772712 
10.000016 0.99132599 0.047989710 
15.848960 1.0071079 0.033385241 
25.118912 1.0137993 0.016466337 
Frequency, mc v32/va1 (r.p.) 032/031 (i.p.) 
0.10000000 —2.9917880 2.7097173 
0.15848935 —2.1786147 2. 2242917 
0.25118873 —1.6316601 1.7969975 
0.39810738 —1.2145871 1.4571936 
0.63095778 —0.88363188 1.1750175 
1.0000008 —0.62260636 0.94088943 
1.5848947 —0.41596602 0.74652410 
2.5118893 —0.25394751 - 0.58298677 
3 .9810768 —0.13047707 0.44414391 
6 .3095828 —0.040132660 0.32580464 
10.000016 0.021255900 0. 22404223 
15.848960 0.055588468 0.13656873 
25.118912 0.063935269 0.064322522 
Frequency, mc vu/iu (r.p.), ohms vu/in (i.p.), ohms 
0.10000000 —77 .329377 —1.1101280 
0. 15848935 —78.639675 — 6 .2732360 
0.25118873 —80. 548062 —11.518517 
0.39810738 — 83 .030268 —17.666091 
0.63095778 —86.205314 — 25 .523024 
1.0000008 — 89 .947652 — 36.084481 
1.5848947 —93.776202 — 50.978284 
2.5118893 —96.541720 —72.748214 
3.9810768 —95.405167 — 105. 27411 
6 .3095828 —83.247510 — 154.12323 
10.000016 —42.817517 —225.93510 
15.848960 66. 135311 —321.23818 
25 .118912 332.08865 — 400.61598 


Frequency, mc 





vai/ta (r.p.), ohms 


v3i/ia1 (i.p.), ohms 


0. 10000000 —5.7971713 0.30779950 
0.15848935 — 5.9324594 —0.10551400 
0.25118873 —6.0322898 —0.46269910 
0.39810738 —6.1941820 —0.89163540 
0.63095778 —6.5156971 —1.3739212 
1.0000008 —7.0303966 — 1.8609257 
1.5848947 —7.7530091 —2.2944494 
2.5118893 —8.6694443 —2.5937969 
3.9810768 —9.7168424 —2.6696293 
6.3095828 —10.780327 —2.4534229 
10.000016 —11.699977 —1.9224920 
15.848960 —12.286170 —1.1462743 
25.118912 —12.411509 —0.33026606 
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TABLE II — TrANSPosED THREE-CONDUCTOR COAXIAL LINE 
Yri = ATi + Bri 











Transposition Interval 21 = 73 ft 9 in. 

















Frequency, mc py tae Pee 
0.1000 1.204 0.07565 1.674 
0.1585 1.257 0.1187 1.747 
0.2512 1.320 0.1871 1.834 
0.3981 1.399 0.2955 1.944 
0.6310 1.519 0.4679 2.111 
1.0000 1.776 0.7471 2.468 
1.585 4.018 1.209 5.584 
1.995 15.56 1.475 21.63 
2.512 9.072 1.415 12.61 
3.162 4.387 0.9181 6.097 
3.981 3.831 0.3169 5.325 
5.012 4.253 0.4368 - §.991 
6.310 14.52 1.367 20.17 
7.943 5.346 0.6606 7.429 
10.00 7.610 0.8457 10.58 

12.60 7.044 0.5103 9.789 
15.85 10.51 1.0378 14.61 


Transposition Interval 21 = 4 ft 73 in. 


3 : 

Frequency, mc wee Lae , Tear 
1.000 1.214 0.04610 2.404 
1.585 1.322 0.07292 2.618 
2.512 1.545 0.1153 3.059 
3.981 1.867 0.1825 3.698 
6.310 2.543 0.2890 5.035 
10.00 3.770 0.4579 7.467 
12.60 4.767 0.5767 9.440 

15.85 6.185 0.7273 12.25 











Transposition Interval 21 = 9 ft 3} in. 








Freueny al, | TSE 
0.1000 1.506 0.009458 1.491 
0.1585 1.574 0.01486 1.558 
0.2512 1.655 0.02337 1.638 
0.3981 1.756 0.03690 1.739 
0.6310 1.900 0.05830 1.882 
1.000 2.124 0.09221 2.103 
1.585 2.464 0.1459 2.440 
2.512 2.990 0.2308 2.961 
3.981 3.804 0.3655 3.767 
6.310 5.301 0.5798 5.249 

10.00 9.102 0.9258 9.012 
12.60 16.45 1.185 16.29 
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Frequency, mc 


0.1000 
0.1585 
0.2512 
0.3981 
0.6310 
1.000 
1.585 
2.512 
3.981 
6.310 
10.00 
12.60 


TaBLeE II — Concluded 








10°a 321 
nepers 


1.2450 
1.4960 
1.776 
2.072 
2.381 
2.715 
3.092 
3.546 
4.126 
4.932 
6.214 
7.533 





Br321 
radians 





0.01740 
0.02391 
0.03354 
0.04826 
0.07121 
0.1073 
0.1640 
0.2535 
0.3947 
0.6180 
0.9763 
1.248 





Yr = C71 I jer 


Transposition Interval — Infinitesimal 











5-mil Dielectric 





Frequency, mc 


0.1000 
0.1585 
0.2512 
0.3981 
0.6310 
1.000 
1.585 
2.512 
3.981 
6.310 
10.00 
15.85 
19.95 


10°eT: 


0.5329 
0.5569 
0.5856 
0.6215 
0.6725 
0.7504 
0.8716 
1.053 
1.326 
1.780 
2.586 
3.944 
4.842 





nepers/meter 


_ BT 
radian/meter 


0.003347 
0.005521 
0.008271 
0.01306 
0.02063 
0.03263 
0.05161 
0.08163 
0.1291 
0.2043 
0.3233 
0.5113 
0.6430 








Total Attenuation, 
db/1057 ft. 


1.419 
1.558 
1.638 
1.739 
1.882 
2.100 
2.439 
2.947 
3.711 
4.980 
7.234 
11.04 
13.55 





1-mil Dielectric 





Frequency, mc 


0.1000 
0.1585 
0.2512 
0.3981 
0.6310 
1.000 
1.585 
2.512 
3.981 
6.310 
10.00 
15.85 
19.95 


10:aT) 


nepers/meter 


_ bt 
radian/meter 





0.5109 
0.5330 
0.5565 
0.5867 
0.6308 
0.6999 
0.8098 
0.9770 
1.232 
1.663 
2.439 
3.762 
4.368 


0.003331 
0.005232 
0.008248 
0.01303 
0.02060 
0.03258 
0.05155 
0.08156 
0.1290 
0.2042 
0.3232 
0.5112 
0.6428 








Total Attenuation, 
1057 ft. 


1.429 
1.491 
1.557 
1.642 
1.765 
1.958 
2.266 
2.734 
3.447 
4.652 
6.825 
10.53 
12.98 
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TasLE III — Two-Conpucror Lins— y = a + 38 





3 otal Attenuation, 
Frequency, mc egos Tineter aalian ait = db/ 1057 f s 
0.1000 0.3350 0.003342 0.9373 
0.2000 0.4252 0.006549 1.190 
0.4000 0.5547 0.01292 1.552 
0.7000 0.7055 0.02242 1.974 
1.000 0.8344 0.03188 2.335 
2.000 1.186 0.06332 3.320 
4.000 1.687 0.1259 4.721 
7.000 2.227 0.2197 6.232 © 
10.00 2.659 0.3133 7.441 
14.00 3.145 0.4380 8.799 
20.00 3.756 0.6250 10.51 
25.00 © 4.198 0.7808 11.75 


“ In the case of infinitesimal spacing of transpositions, the propagation 
constant ym for the low-loss mode was computed from (35) for both the 
actual cable and a hypothetical one in which the intermediate conductor 
was separated from the inner conductor by only 1 mil instead of 5 mils. 
These quantities appear at the end of Table II. 

In order to have a calculated curve to compare these two results with, 
the propagation constant of the two-conductor reference cable was also 
computed by modifying the process used for the uniform line. These 
quantities are shown in Table IIT. 

The numbers delivered by the computer were rounded off to four sig- 
nificant digits in Tables II and III. In the table of results for the 9 ft 

+ in. interval, the values for infinitesimal spacing were used up to 1 
me because the eight-digit precision of the computer was not sufficient 
to calculate the roots of (82) at these frequencies. At 1 mc the two meth- 
ods give answers differing by about 0.1 per cent. 
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Synthesis of Series-Parallel Network 
Switching Functions 


By WARREN SEMON* 
(Manuscript received February 13, 1958) 


From the switching functions of n variables, those which correspond to 
networks are abstracted and called network functions. Properties of those 
network functions corresponding to series-parallel networks are studied 
and a method for synthesis 1s developed. 


I. INTRODUCTION 


It is known that one may establish a mathematical model in which 
Boolean polynomials explicitly represent two terminal single-impulse 
series-parallel contact networks. The conventions used in this paper are 
Boolean plus, symbolized by ©, to represent parallel connection; Boolean 
times, symbolized by - or by juxtaposition, to represent series connec- 
tion; zero, to represent an open circuit; and one, to represent a. closed 
circuit. The symbol ’ is used to represent Boolean negation, also called 
“inversion” or ‘‘complementation.”’ Whenever f represents an open 
circuit, f’ represents a closed circuit and conversely. This is a conductance 
analogue, the dual of that used by Shannon.’ If f(a, 2, ++: , @n) is a 
switching function it may be represented in tabular form’ asa “canonical- 
form matrix” whose rows consist of those configurations of the variables 
for which f = 1, as shown below for the function 


f = x (ae’ @ 23') = 2y'X9'x3 © X1'Xo' x3 O aya o79. 





* The Computation Laboratory of Harvard University, Cambridge, Mass. 
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Definition 1 


A “series-parallel”’ network is a series or parallel combination of two series- 
. . . 3 
parallel networks; a single element is a series-parallel network. 


Definition 2 


A “network” function is a switching function of n variables, having no 
vacuous variables, which can be realized by a network containing n switches. 

As an example: f(a, v2, %3, %4) = 21%" ® 23’, is a network function. 
A four-switch realization is shown in Fig. 1. 


Definition 3 


A “sertes-parallel network function” is a network function at least one of 
whose n-switch realizations is a series-parallel network. 

The network of Fig. 1 is series-parallel, hence the corresponding func- 
tion is a series-parallel network (SPN) function. 

In what follows we will be concerned with the slnacteration of 
SPN functions and the synthesis of the networks to which they cor- 
respond. It should be noted that, although series-parallel networks com- 
prise a small subclass of the class of all relay contact networks,’ there are 
other types of switching elements for which they represent the only 
electrically useful networks. 


II. GENERAL THEORY 


Theorem 1 


The inverse of any SPN function ts also an SPN function. 
Proof: A Boolean expression may be derived which corresponds explicitly 


Fig. 1 — Four-switch realization. 
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to the series-parallel realization of the function.* DeMorgan’s laiv, when 
applied to this expression, yields a Boolean expression which exactly 
corresponds to a series-parallel n-switch realization of the inverse func- 
tion. 
Theorem 1 may be extended to cover all network functions whose 
n-switch realizations are planar.’ 


Theorem 2 


A minimum generating set* of network functions of n variables may be put 
into a one-to-one correspondence with the distinct} networks of n elements. 
A proof of this theorem has been given by Ashenhurst.° Some implications 
should be noted explicitly: (a) the n-switch realization of a network func- 
tion is unique, and (b) only network functions correspond to networks. 


Theorem 8 
Suppose that {(x1 , X2, °** , Xn) isan SPN function. Thenf can be expressed 
asan SPN function of the variables a,, a2, +++, ax, where each a is itself 


an SPN function of the variables in one of k nonoverlapping subsets of ne 
variables x; , X2, °** , Xn. 

Proof: This is Sbuious from the definitions of SPN function and series- 
parallel network. 


Theorem 4 
Let f(x1, X2, +++ , Xn) be an SPN function which in canonical form has 
m terms. Let f be expressed as an SPN function F(a,, a2, ++, ax), 


k Sn, where each a; is ttself an SPN function of 8; variables with m, terms 
in its canonical form (as a function of its 8; variables only). In the canonical 
form of F(ai), replace each a; by mi, each ai’ by mi’ = 2°! — mi, each 
Boolean @® by algebraic + and each Boolean - by algebraic -; then the 
result 1s M. 

Proof: It is only necessary to prove the theorem for two variables, since 
Definition 1 and Theorem 3 may then be used to complete an inductive 
proof. Suppose f equals ayae.f Each term of the canonical form of f may 
be formed by selecting one of the m, terms of a; and one of the m2 terms 


* The group of variable transformations (permutations and inversions of the 
variables) divides any class of switching functions into N nonoverlapping subsets, 
called equivalence classes; a set of N functions, one from each equivalence class, 
is called a ‘‘minimum generating set.” 

{| Two networks are ‘‘distinct’’ if one is not derivable from the other by trans- 
formations consisting of the reversal of two-terminal subnetworks. 

t Such a function is said to be ‘‘essentially series.”’ 
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of a,. All such terms must appear, and no others can. Hence m equals 
MMe . 

Now suppose f equalsa; © ae .* Foreach of the m, terms of a; there are 
2°? configurations of the variables of a2 which appear in the canonical 
form of f; similarly, 2°! configurations of the variables of a; are associated 
with each of the m2 terms of a2. This gives 2°'m, + 2°?m, terms. Of 
these, mm are duplicates. Hence 


Ss Ss 
m = 2m. + 2°°m, — mm, 


= mm! + my’m2 + mm. 


Theorem 5 


Tf f(x1, +++ , Xn) ts any SPN function, and m 1s the number of terms in tts 
canonical form, then m ts odd. 
Proof: By induction on n. The theorem may be verified by inspection 
for the cases n = 1 andn = 2. SO 

Now assume the theorem true for all k < n. Consider an arbitrary 
SPN function of n variables. It can be expressed either as ajay or as 
ay ® ay, where a; and a2 are SPN functions of fewer than n variables. 
Again let m,; be the number of terms of a; and m; the number of terms of 
az. Then 


m= mM, + Me 
is odd, or 
So 8} 
m = 2°?*m + 2°'me — myme 


is odd, since m, and m, are odd by the inductive hypothesis. This proves 
the theorem. 


Corollary 


It follows immediately that any network function for which m is even has 
only a bridge circuit realization. 
As an example, consider thefunction f(a , x2 , 73 , 4 , 3), Whose canonical- 
form matrix is given in Fig. 2 (m = 16). It has the five-switch realization 
shown, but no series-parallel five-switch realization.t 

It is reasonable to ask whether an odd number of terms in the canon- 
ical form of a network function implies that its realization is series- 
parallel. This is not so, as may be seen from the following example. 
* Such a function is said to be “essentially parallel.” 


t Note that Theorem 2 asserts that there is no other five-switch realization of 
this function. 
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The function F'(21,%2,23,24,%s,%6,27), Whose canonical-form matrix and 
seven-switch realization are shown in Fig. 3, has m = 59 but the net- 
work is not series-parallel. 


Theorem 67° 


The function {(x; , x2, +++ , Xn) defined by f; = 1,*1 = 0,1,---,m—1 
and f; = 0 fori = mis an SPN function for m odd. 


ta 
Aa 


Xo X3 X4 X5 


x ey 
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== Oc OorF KB FSP OCC OCoCoooo 
COrcocoorFrF FeO OCOo 
CSeoeooocoereccorreoocorrdco 
RB OCOrFOOROrFOFROrFROHO 








Fig. 2 — Five-switch realization. 


Definition 4 


The SPN function defined by f, = 1,1 = 0,1, ---,m — land f, = 0 
for i 2 m, m odd, ts called a ‘“‘canonical-form network function.” 
All SPN functions of one, two and three variables are, to within a vari- 
able transformation, canonical-form network functions. The first SPN 
functions which are not canonical-form network functions appear in 
four variables. Such functions may be characterized by the following 
discussion. 
Consider the Boolean expression for an SPN function which explicitly 
represents’ the n-switch series-parallel network for the function. Then 
i. There may exist a variable x such that f = ag, with g an SPN 
function. 


* Here, f; is the value of the function for the configuration of the variables 
which would represent the integer 7 in the dyadic number system. 
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ii. There may exist a variable x such that f = x © g, with g an SPN 

function. 

iil. The function may be such that no 2 exists which satisfies either 

1 or il. 
In cases i and ii the function is said to be ‘‘reducible”’; in case i, “‘ir- 
reducible.” 

If a function is reducible, then the ‘‘residue’”’ function g may be ex- 
amined for reducibility. For any SPN function such reduction will ulti- 
mately lead to a residue function which is (a) a single variable, or (b) 
irreducible and a function of more than one variable. An SPN function 
whose ultimate residue function is a single variable is said to be ‘‘com- 
pletely reducible.” 


Theorem 7 


An SPN function is, to within a variable transformation, a canonical-form 
network function if and only wf it is completely reducible. 
Proof: By induction on n, the number of variables. For n = 1 and n = 2 
the theorem is obviously true. 

Now assume the theorem true for all k < n. Suppose f to be an SPN 
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Fig. 3 — Seven-switch realization. 
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function of n variables which is completely reducible. Then either f = 2’g 
or f = 2’ @ g, and g is a completely reducible SPN function of n — 1 
variables. The inductive hypothesis then guarantees that f is a canonical- 
form network function. 

Suppose now that f is a canonical-form network function of m terms. 


(a) If m < 2”, then f equals xn'g(a1, -+* , %n—1), Where g is a canon- 
ical-form network function. . 
(b) If m > 2", then f equals rp’ ® g(a, «++, %n-1), Where g is a 


canonical-form network function. 
In both cases, the inductive hypothesis guarantees that g is completely 
reducible, hence so is f. This proves the theorem. 

The irreducible SPN functions of n variables are a set of some in- 
terest. One question occurs immediately: How many such functions are 
there? This may be answered as follows. 

Let S, be the number of functions in a minimum generating set of 
SPN functions of n variables. Since, by Theorem 2, the correspondence 
between functions and networks is one-to-one, S, is the number of 
series-parallel networks of n elements.* Let 7, be the number of functions 
in a minimum generating set of irreducible SPN functions of n variables. 


Theorem 8 
In41 — Sati —= 28n . 


Proof: S, is the number of series-parallel networks of n elements, and 
7n is the number of series-parallel networks of n elements which have no 
single elements in series or in parallel with the entire network. Form 
networks of n + 1 elements by adding a single element in series to each 
of the S, networks of n elements; form the corresponding set by adding 
one element in parallel. There are 2S, such n + 1 element networks. 
None of these corresponds to an irreducible function. If the networks 
corresponding to all the irreducible functions are now adjoined, giving 
2S, + in41, this includes all the SPN functions of n + 1 variables, since 
any reducible function is included in the set of 2S, functions, and all 
irreducible functions are included in the set of 7.1; functions. Hence 
Snai = 28n + tayr. It is obvious that there are 2” completely, reduc- 
ible SPN functions of n variables since they are canonical-form network 
functions, and these correspond to the odd integers m, 1 S m S 2". 
The remaining SPN functions of n variables are each either irreducible 
or reducible to some irreducible,function. Theorem 9 exhibits this divi- 
sion. 


* These numbers are discussed in Ref. 3. 


884 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 
Theorem 9 
, n—2 
= : . —j-1 
Seen Pose iets Dad 
j=1 


(a) The term 2”~ represents the canonical-form network functions of 
n variables. 

(b) The term 7, represents the irreducible SPN functions of n variables. 

(c) The remaining terms 


n—2 

. n—j-1 
s 1412 
j=1 


represent the SPN functions which are reducible to irreducible functions 
of more than one variable. To prove Theorem 9, it is only necessary to 
solve® the difference equation, Sn4i — 2S, = tn41. Table I shows the 
numbers of interest for n = 1(1)10. 

Let f(v:) be a function of n variables, represented by a canonical-form 
matrix of zeros and ones (n columns, m rows), as in Section I. Let mio 
and mi = m — mio be equal to the number of zeros and ones respec- 
tively in the x; column of the matrix. 


Theorem 10 


If f(xi) is an SPN function, tt ts a function of x;’ or x; according as mj, 
or Mjo 28 the lesser. 








TABLE [ 
n—2 
n Sn tn m= D ijp-71 Cn 
j=1 
1 1 0 0 1 
2 2 0 0 2 
3 4 0 0 4 
4 10 2 0 8 
5 24 4 4 16 
6 66 18 16 32 
7 180 48 68 64 
8 §22 162 232 128 
9 1532 488 788 256 
10 4624 1560 2552 512 


S, = total number (taken from Ref. 3) of SPN functions in a minimum gener- 
ating set. 
Zt, = number of irreducible SPN functions in a minimum generating set. 
Yn, = number of reducible but not completely reducible SPN functions in a 
minimum generating set. 
C,, = number of completely reducible SPN functions in a minimum generating 
set. 
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Proof: By hypothesis, f can be written as f = 2j;*A @® B, where (a) x;* 
is either x; or x;’; (b) A and B are functions of the remaining variables 
only. Then 


f = xjrA ® x;*B ® x*’B 
xj*(A ® B) @ «7B. 


Since B S$ A @ B, the number of terms in B is equal to or less than the 
number of terms in A ® B. Now suppose that 2,;* be equal to x;, then 
f = 2;(A @ B) @ z,’B. But the number of terms in B is mj, and the 
number of terms in (A © B) expanded as a function of all n — 1 vari- 
ables is mj, , hence mjo S mj. But m = mjo + my is odd, hence mj < 
my. 

Similarly, if «;* = 2,’, mj is the number of terms in (A © B) and 
m,;, is the number of terms in B, so that mj, S mj. Again, the equality 
is impossible, so that mj, < mj. This completes the proof. 

As an example, consider the series-parallel network shown in Fig. 4. 
The corresponding function is 


f(x) = ay’ (x4 © X2'2s), 


and the canonical-form matrix for f is: 





Fig. 4 — Network for f(z) = 21'(%4 @® 22’%3). 
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The counts for each element are: 





By Theorem 10, fis a function of 21', x2’, x3 and a, and this is clearly 
true. . 

Consider any element x; in a series-parallel network. Let A; represent 
all of that part of the network which is in parallel with z;. Let Bi 
represent all of that part of the network which is in series with the 
parallel combination of A; and x;. Let g, be equal to By(A; @ z;). In 
general, let A; be that part of the network which is in parallel with 
gi1, and B; be that part of the network which is in series with the 
parallel combination of A; and g;1. Then an SPN function f may be 
expressed as: + 


N N N 
ap = vj IT Bi ® 2 Ae IT Bi,* 


where A; and B; are SPN functions of distinct variables (see Fig. 5); 
A, may be the trivial function 0, while By may be the trivial function 1. 


Definition 5 


x; 1s said to be a “series” element if and only if Ar = 0, and x; is said to 
be a “parallel” element if and: only if Ay # 0. 


Theorem 11 


In the circuit for an SPN function, x; ts a series (parallel) element if and 
only if the lesser of mjo and mj; 1s even (odd). 
Proof: Suppose that f is expressed as: 


N N N 
f= x; |] B; ® > A. I] B:, 
q=1 k=1 i=k 
with A; and B; SPN functions of nonoverlapping subsets of the vari- 


* Henceforth the symbol > will be used to signify extended application of the 
operation @. 
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ables. Then 


N 7 N N 
f(z; = 1,A:, B) = I[B: @ Dd Av IDB: 
rey] = rE 


isk ... 


N oN N N 
I] 8. @ ArT] B: © 2) As TT Bi. 


Clearly A; is a vacuous variable in this function, and wherever A,P 
appears in the canonical form A;’P appears also (P is a product involv- 
ing Az, -::, Aw, Bi, --: , By and their primes). Hence, if m4, is the 
number of terms in A; and A, is a function of S4, variables, then, by 
Theorem 4, ms, + ma,’ = 2°41 isa factorof mj. If A; 0, then Su, 4 0. 
hence m,; is even and mj odd. 

Now suppose A, is identically equal to 0, then © 


. N N 
fle; = 0, A;, B) = 2, Ar ITB: 


N N N 


Ai 1] Bi @ 2 Ae WDB: 


N 


N 
= > A. I B:, 


== 2 t=k 


or 


é Ss . 
and clearly B, is vacuous. Hence, mg, + mz,’ = 2°71 is a factor of mjo, 





xX; 
Aa 4 
By | A3 
7 
” ‘ : By 
2 5 | 


N° WN N 
Fig. 5 — Network forf = 2;].B:@ > A, ID B: 
i=1  k=1——Oi=k 
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and m,o is even, mj, odd. (Note that S4, = Sz, = 0 implies f = 0; 
this case need not be considered.) 


Theorem 12 


If two elements x; and x, are in series or in parallel, then mjo equals 
Mxo and mj, equals My, . 
Proof: If the function is expanded as 


N N N 
f= 21] B.@ 2 A TB, 
or as 
N N N 
f= (x; ® m) ITB: © 2) A TD B., 


and Theorem 4 is applied, the result follows immediately. 

It should be noted that the converse is not true; that is, mjo = mxo 
and mj, = mma does not necessarily imply that x; and x; are in series or 
in parallel. This will be seen in one of the examples which will be given. 


Theorem 13 


One of mjo and mj, 2s even; call this m. Let k be the largest integer such that 
2* | m.* Then k is the number of switches in the subnetwork which is in 
series or in parallel with x; . 

Proof: Suppose that f is a function of x; (not of z,;’/). This assumption is 
irrelevant in the argument, and is made only for the sake of definiteness. 
Then mj < mj, . 

Case 1: Suppose that x; is a parallel element. Then m jo is odd, by Theorem 
11, hence m = mj. Clearly, in the function f(z; = 1, A;, B:), Arisa 
vacuous variable. Therefore, in the canonical form of f(a; = 1, A;, B:) 
wherever A;’P appears so does AiP. Hence, by Theorem 4, 


Ma = (Ma, + ma,')C 
= 29410, 


But C is odd (if it were even, then removing the subfunction A, would 
make x; a series element, with mj odd, and this is impossible by Theorem 
11). Therefore 2°4: | m, and 2°417” \ m fork = 1,2, ---. 

Case 2: If x; is supposed a series element, then clearly ™ = mj. Also 
B, is vacuous in f(x; = 0, A;, B;). (Note that A, = 0 by hypoth- 


*a | b as usual means “a divides b”; a \b means “‘a does not divide b.” 
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esis.) A similar argument shows that 2°*:| m, while 26317" \ a for k 
= 1, 2, --- . This proves the theorem. 


Theorem 14 


Consider an SPN function f. Of f and f’, one ts essentially series, say f. 
Then f may be expressed asf = a+ B, where a, B are SPN functions of 
nonoverlapping subsets of the variables. If a equals a(x;), then mg | mjo 
and mg | mj, ; if B equals B(x;), then me | Mjo and mg | Mj; . 

Proof: If a equals a(x;) then mg | mj and mg | my, since every term in 
the canonical form of a (as a function of its subset of the variables) is 
repeated in the canonical form of f exactly mg times. 


III. SYNTHESIS 


Consider the function defined by the canonical-form matrix: 


8 
8 
to 


oO 





0 
0 
0 
0 
0 
1 
1 
1 
1 
1 
1 
1 
1 
1 
1 


Se | = FSF KF CO CO CO OC CO KF Fe KF Ee = 
SKS Be Be Be OR KF HF KF CO HF SF EF Se 
Soe ©. ae ee ese So. Se He eH So Lo Oe 
- Oo K§ Or KF CO Kf CO KF KF CO KF CS 


Suppose it is given that this function is an SPN function, and it is re- 
quired to find the network. 

Table IT shows the counts for each variable and the information which 
is derived from these counts by application of Theorems 10 through 14. 
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TaB_e IT 

: Prime Factorization of 

j mio min Variable. | -Senesot 
mio Mir 

1 5 10 heal P 1 5 2, 5 
2 5 10 Zo Pp 1 5 2, Be! 
3 3 12 13 P 2 3 3, 2? 
4 6 9 ae S 1 2,3 32 
5 6 9 5 S 1 2, 3 3? 


The column headed “variable’’ is filled in by applying Theorem 10. 
Since in every case mj < mj, only unprimed variables occur. The 
column headed “series or parallel’’ is filled in by applying Theorem 11. 
Since mo , M29 and mo are odd, 2 , X2 and 23 are parallel elements; 2, and 
Xs are series elements, since m4) and mso are even. The column headed 
“‘k”? is filled in for use in applying Theorem 13. Since 2 | 10 and 4 } 10, 
the values for a; and 2; are 1. Also 4| 12 and 8 } 12 givesk = 2 fora. 
Similarly for a; and 2x; the values are 1. The last two columns give the 
prime factors of mj and mj. Since my = ma, = 10 and each of x and 22 
is in parallel with a single switch and since no other m, equals 10, 2 
and 22 are in parallel, i.e., (vi © x2) represents a component of the net- 
work. Similarly (a - 25) represents a component. Since 2; is in parallel 
with a two-switch network but not in parallel with 2, (by Theorem 
12), it must be in parallel with x, - 25. Hence (v3 © wavs) represents a 
component. Since x; cannot be in parallel with 2, these two components 
must be in series, therefore f = (a; © 22) (% @ aX) and the net- 
work is as shown in Fig. 6. Note that if a equals (71 © 22) and 8 equals 
(x3 ® 2423), then mz = 3, mg = 5 and 


me | mo me| mu, Me | Map , me | Mex me | Meo ; 
Mea | Ms , Ma|Mo, Mel Ma, Ma| Ms, Mal Ms, 


as required by Theorem 14. If Theorems 10 through 14 are to be applied 
mechanically, the following fact should be noted. Once it has been deter- 


Xa Xyg—— Xs 


ty <3 


Fig. 6 — Network for f = (%1 @ 22) (%3 @® 245). 
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Then the original canonical form may be rewritten 
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mined that (a; © 22) and 2425 represent components, these combinations 
may be treated as single elements; the canonical form may then be 


modified and the count repeated. For example, let A equal (a; ® 2») 


and B equal x4%s . 
(in two steps) as 
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DOOR FAA AA AAA HHH OOCON AAA AAAs HOOCORne:M 





COCCCOnm Ft FOC OCnM MFO COCOCOCOnR nH OOOnANRoOoCoOCoCoO 
coocococoocoococOnnAArnOCooCoooCoOoCOCONMAAnAROoOCOCcoO 





Se so Pc oe ce ce oe OB eB Be oe ee ee ee ee ee ee Oe ae) 
Se Ocoee cn Bh ce Be ce ce Bs Bh ce en ee Be ee ee ee ee oe eo ee ae) 


CocococoooCcooCoOoOOC COCO RN A AAnin neat OOCOOCO 


Consider the canonical-form matrix: 


U1 %2 X%3X4 XH Lo X7 Lg Xg 


cooocoocoocoooooooococoecooococCooo oon AAAS 


and this function is obviously A(z3 @ B). 
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TABLE IIT 
Prime Factorization of 
j mjo m1 Variable | Sor P k 
myo m1 
1 30 75 21 Ss 1 2, 3, 5 3, 5B? 
2 75 30. | aa! S 1 3) 5 2, 3, 5 
3 30 75 x3 NS] 1 2, 3, 5 3, 5? 
4 30 75 x4 Ss 1 2, 3, 5 3, 5? 
5 63 42 x5’ Ss 1 37, 7 2, 3, 7 
6 63 42 xe! Ss 1 32, 7 2, 3, 7 
7 21 84 a7 P 2 ie: 237 
8 70 35 x3" P 1 2, 5, 7 5, 7 
9 35 70 2X9 P 1 5, 7 2, 5, 7 





This gives Table III. It is evident that 

i. one of 2122’, 1123 , 11% appears in the network; 

ii. 2s'v—’ appears in the network; 

iii. ve’ ® xy appears in the network. 
In order -to select the correct one of 21%’, %1%3, 214, Table IV is con- 
structed. Note that if zy is to be a component it must be possible to 
substitute A for xy in the canonical form. Then a count can be made 
of mao and m,4,. But A equals zero implies three terms as shown: 





A=1 A=0 
xy xy 
11 00 

01 
10 


Therefore, the number of occurrences of zy = 00, zy = Ol and zy = 10 
must all be equal. Clearly, in the example only the combination 222’ 
satisfied this requirement. This implies that a 2%’ and 2374 represent 

















TABLE IV 
00 o1 10 il 
1129 15 15 60 15 
W173 0 30 30 45 
L104 0 30 30 45 
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components. Reducing the canonical form in terms of the new variables 


yr = X22! 
Yo = Ut, 

Ys = U5'Xe 
Ys = L7 


Ys = 3) @ Ny 


gives the following matrix: 


nnd 
oe 
= 
nd 
x 





0 
0 
0 
1 
1 
1 
1 
1 
1 


KH & KH CO CO CO KF SF F 
-S KF CO KF KF CO KK KF CO 
—- CO KF FF CO KF KF CO Ff 
EE ee EE oe 





Analysis of the counts for this matrix is shown in Table V. A count for 
YY2, WY3s, yiys yields Table VI. Therefore z, = 2x12’ ® W304, 22 = 
X5'X6 @O Xz, and 2, = xg’ ® 4 represent components. Again reducing 
the canonical-form matrix leads to 





21 2g &3 


111 








Hence the function is 
f = (@1%2" © xsarq) (ar5/a6’ @ 2X7) (Hs? © 4) 


and the network is as shown in Fig. 7. 

It should be noted that an attempt to apply Theorems 10 through 14 
to a non-SPN function will lead to a contradiction. For example, the 
function whose canonical-form matrix and bridge network are shown in 
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TABLE V 
ee Prime Factorization of 
j mjo mir Variable | Sor P k 
Be mio ma 
1 3 6 v1 P 1 3 3, 2 
2 3 6 Ye P 1 3 3, 2 
3 3 6 Ys P a | 3 38, 2 
4 3 6 Ya Pp 1 3 3, 2 
5 0 9 Ys S a = 32 
TasBLe VI 
00 ol 10 11 
Wye 0 3 3 3 
Y1Y3 1 2 2 4. 
Yys 1 2 2 4 
C3-—— Ly X7 L9 
Xy—— XS Xe—— XG Xe 


Fig. 7 — Network for f =(ait2’ @ 23%4) (x5’%6' B 7) (te’ W 2p). 











TaBie VII 
j mio mir Variable SorP k 
1 38 21 xy! P 1 
2 43 16 Lo! s 4 
3 33 26 x3" 8 1 
4 34 25 x,’ P 1 
5 33 26 Ze" S 1 
6 38 21 Le P 1 
7 43 16 Ly’ SS) 4 











Fig. 3 yields Table VII. If this function is to be an SPN function, 2,’ 
must be in parallel with a one-element subnetwork. But no other element 
exists with the number pair (84,25) as (mj, mj), and this contradicts 
Theorem 12. 

The results produced thus far by this investigation suggest many 
intriguing possibilities. For example, if a function is not an SPN func- 
tion because it requires both x; and x,’ contacts, can rules be derived for 
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augmenting the canonical form of the given function so as to produce 
the canonical form of a new function which (a) is an SPN function of 
n + 1 variables and (b) reduces to the given function when the new 
variable is replaced by x;’/? Obviously this notion can be extended to 
cover all functions by introducing a large enough number of new vari- 
ables. That this is not a trivial question is shown by the following 
example: 
Let f(x1 , 22 , 3, V4) be defined by the canonical-form matrix 


(1) 





Since m = 6, it is clear that this function cannot be an SPN function. 
Let us assume however, that it can be realized using five switches in- 
cluding both an x2 and an 2’, and attempt to form a function g(a , 22 , 
x3, 24, @) such that (i) g isan SPN function, and (ii) g(a, 2, 23, %4, 
a = 2') equals f(a1 , v2, v3 , v4). We proceed as follows: 

1. Copy the canonical form matrix of f, adding a column headed a. 
This column is the inverse of the column headed 2: . 


(2) 





Condition (ii) requires that these terms appear in the function g. 
2. The canonical form matrix for f’ is similarly augmented, giving 
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2 
& 
w 

8 
ix) 

8 
Cy 





(3) 


See eet ee OOO 
KSPR OCccdOrRRrH 
KSrRe ORF OCk KO 
RPOrRF OrFROrFOF 
COCRrKEFErOCOOO!:Ss 





and condition (ii) requires that these terms appear in g’. 
3. List the remaining possible terms of g. (These will include all 
combinations for which a = 2.) 


8 
§ 
- 
8 


(4) 


eee eee OOOCOOCOCCOCO 
BH HODOOOHHHEH OOOO! SA 


0 
0 
0 
0 
f 
1 
1 
1 
0 
0 
0 
0 
i 
1 
1 
1 


BPH OORPHFHOORHOOHHOO 





Since these terms will vanish when a is set equal to x2’, any of them may 
appear in the function g. Assuming that g is to be an SPN function of 
primed variables only, it follows that 
(a) whenever, in the function g, the combinations 2,0a3%41 or a1237,0 
appear, so must 2,0x32,0, and 
(b) the combination x,1x3041 may appear in g only if 2,0r37,0, 7,0x3141 
and 2123x,0 all appear as well. 
4. By the application of these rules the terms in (4) may be divided 
into three sets: (a) those which must appear in g, (b) those which must 
appear in g’ and (c) those which these rules do not determine. For this 
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case, these sets are 


not determined 


/ 


appear in g 


appear in g 





5. Combining these with the known terms of g and g’ gives 


not determined 


, 


appear in g 


appear in g 
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The terms presently given as appearing in g form an SPN function 
g = 2! 20! ® x34 a! 

- and 

g(t1 , te, 13, La, @ = Ae!) = 2y'ap) OD Xors'ty! = F(X, Xe, Ls, 4). 


However, note that if all the terms listed as ‘‘not determined” are ad- 
joined to those which “appear in g,” a canonical form for the SPN 
function 


(1 Xz, 3, Xe, a) = (a © a’) (ae! © 213'x4') 
is produced, and 
(Xr, Xe, L3, Xe, = Le!) = (ay! © a2) (ae! © atx’) 


= f(w1 , Xz, U3, Xa). 


From this example, it isclear that the requirement that g beanSPN func- 
tion, with g(a, 2, %3, %1,@ = Xe’) = f(a, Xe, %3, Xs) does not specify 
g uniquely. 


IV. CONCLUSIONS 


Clearly the derivation of the circuit for an SPN function is a process 
which can be mechanized. If a method for augmenting the canonical 
form of a non-network function can be formulated, then a purely me- 
chanical process could be set up for deriving the minimum series-parallel 
networks for an arbitrary function. 
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Circular Waveguide Taper of 
Improved Design 


By HANS-GEORG UNGER 
(Manuscript received August 22, 1957) 


Transitions between round waveguides of different sizes for Ti trans- 
mission are required to be free of mode conversion. Conical tapers with a 
gradual change of cone angle transform cylindrical waves in the round wave- 
guide into spherical waves in the transition region. They thus cause very 
little power conversion to spurious modes. Optimal tapers and almost opti- 
mal tapers are found. An improved taper connecting $ in. to 2 in. 1.D. 
waveguide must be 3 ft long to keep the spurious mode level below —50 db 
for frequencies up to 75 kmc. A taper of constant cone angle would have to 
be 58 ft long to obtain the same spurious mode level. 


I, INTRODUCTION 


In long-distance waveguide transmission, multimode waveguides of 
large diameter must be used to exploit the low-loss properties of the 
circular electric wave. Multiplexing of a series of frequency bands into 
one pipe is, however, most easily done at certain smaller diameters. Like- 
wise, sharp intentional bends can be negotiated more easily at small 
diameters. Therefore, transitions between different diameters must be 
made quite frequently in a waveguide system. 

If these transitions are built in the form of a conical taper which 
matches the waveguide sizes of both sides, they tend to excite higher 
circular electric waves. Since no simple means are known to suppress 
higher circular electric waves without affecting the lowest circular elec- 
tric wave, mode conversion-reconversion distortion can only be avoided 
by keeping power in all the higher circular electric waves at an extremely 
low level. Transitions are therefore required which will introduce very 
little conversion to higher circular electric modes. Conical transitions 
with a constant cone angle, unless they are made very long, generally 
excite too high a level of these spurious modes. 

Wave propagation in the conical taper is most easily explained i in terms 
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of normal modes. The normal modes of the circular electric wave fam- 
ily — the only family of interest here — are, in the cylindrical guide, 
cylindrical circular waves with plane equiphase surfaces and, in the 
conical guide, spherical circular waves with spherical equiphase surfaces. 
At the junction of a cylindrical guide to a conical guide — such as occurs 
twice in a conical transition — a cylindrical wave from the cylindrical 
guide excites a series of spherical waves in the conical guide. For in- 
stance, an incident TE: wave will excite all the TE, waves, thus caus- 
ing a rather high spurious mode level. 

To avoid this mode conversion, a transition which transforms the 
cylindrical wave into the spherical wave must be introduced at the junc- 
tion. S. P. Morgan! has suggested and worked out the design of dielectric 
inserts placed near the junction which, acting as quasi-optical lenses, 
transform the cylindrical waves into the spherical waves. However, be- 
cause of the dispersive character of the lenses and of the waveguide, 
good broadband performance is difficult to achieve. 

Another way of making a transition from cylindrical waves to spheri- 
cal waves — at least approximately — is to taper the cone angle from 
zero at the cylindrical guide to the finite value of the conical guide. If 
this is done gradually enough, nearly all the power incident in the cylin- 
drical wave will be transformed into the spherical wave, with a very low 
spurious mode level. Although this is only an approximate solution to 
the problem, it does have good broadband characteristics. 


II, WAVE PROPAGATION IN A TAPERED WAVEGUIDE 


In a tapered circular waveguide such as the one shown in Fig. 1, the 
radius @ is related to the distance z along the taper by a function which 
is assumed only to be smooth and to have a zero first derivative at both 
ends. 




















Fig. 1 — Tapered waveguide transition. 
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The field excited in any cross section by an incident TE wave can 
be expressed as the sum of the To, waves of a cylindrical guide having 
the same radius as that of the cross section. With this representation, 
the tapered waveguide appears to be an infinite set of mutually coupled 
transmission lines, each line representing one of the cylindrical TEon 
waves. The wave propagation is described by an infinite set of first- 
order differential equations. If we assume the taper to be gradual enough, 
the power in all TEon terms with m greater than 1 will be small compared 
to the power in the TEq term. We may then consider only coupling 
between TE and one of the TEo, terms at a time. Furthermore, we 
need consider only the forward travelling waves, since the relative power 
coupled from the forward waves to the backward waves is quite small. 
Thus, the infinite system is reduced to the well-known coupled line 
equations? 


a = —jRiA1 + kA ? an = —jB2A2 ae kyAi , (1) 
Z dz 

in which Ai. are the complex amplitudes of the cylindrical TE, and 
TEom respectively, 81,2 are the phase constants of these cylindrical modes 
and ko, , ki2 are the coupling coefficients. The coupling coefficients are cal- 
culated in the Appendix by converting Maxwell’s equations into general- 
ized telegraphist’s equations. Both the phase constants and the cou- 
pling coefficients are functions of the distance along the taper. Power 
conservation requires 


| Keys | aa | eos 


(2) 


kykar = —k?, 


Coupled line equations with varying coupling coefficient and varying 
phase constant have been treated by Louisell.’ 

Tn order to solve (1), local normal modes must be introduced. Normal 
modes are waveforms in a uniform waveguide which propagate without 
change of field configuration or, in terms of the coupled line description, 
do not couple mutually. Analogously, local normal modes in a non- 
uniform waveguide are waveforms of a local cross section which would 
propagate without change of field configuration in a waveguide which is 
uniform with respect to that local cross section. For example, local nor- 
mal modes in the waveguide taper are the spherical waves of a conical 
waveguide with a cone angle corresponding to the local cone angle in 
the taper. They would propagate in the conical waveguide without mu- 
tual coupling. 
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For the system of equations (1) local normal modes are defined by 


A= (W, cos 5 & — Wesin £8) exp (=i [ B iz) : 
; (3) 
A, = -4/% (Ww, sin = =e + W2 cos $8) exp (-1 | pac), 
v21 
in which 
goes Bare goa oe (4) 
2 , Bi — Be Ag: 


Upon substituting (8) into equations (1), the local normal modes must 
satisfy 


dW, 1 dé W dW. 
2) 


_ apy, = 14% 
ae WW = 55 ae jTWe = 5 qa (5) 


where 


I) = $V AR + AP. 


Equations (5) are coupled only through the terms proportional to dé/dz. 
If € is constant, they reduce to uncoupled equations for W, and W2. 
Tor a gentle change in taper angle, with 


lat 


OT de <1 (6) 





approximate solutions of (5) which proceed essentially in powers of 
dté/dz can be written down: 


Wi(z) = ge) [ws +3 aie W2(0) |, er sete) dz ’ 


== : 3 pseke") . oa —2 jp(z"") ” , 
A W,(0) , ae é ‘ de” e dz” dz ; 
(7) 
W2(z) = eh | 20) _ ! W.0 | oe e Pie) gat 
2 o dz 
ae i W.(0) i ae ge 7P@) e a rine") dz” a | 
4 0 az 0 dz 


in which 


ses “T(e") de’. (8) 
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The initial conditions in the taper are Ai(0) = 1 and A.2(0) = 0. The 
taper begins with zero cone angle; hence, from (3), 


Wi(0) = 1 W.(0) = 0, 


in which W, corresponds to the TE, wave and is the wanted local nor- 
mal mode, while W. corresponds to one of the TEo, waves with n # 1 
and is an unwanted mode. At the end 2 of the taper, the unwanted mode 
amplitude is 


| We(a)| = = | f : dé —2 jp(z) He (9) 


de” 





At the taper end the cone angle is again zero, (€(z:) = 0). Therefore 
| W2(21) | equals | Ae(z:) |. Equation (9) integrated by parts: becomes 


[ Tee 7 dz | : (10) 
0 ‘ 





| Wala) | = 


The mode conversion in a smooth but otherwise arbitrary taper can be 
calculated with (9) or (10). 


III. DESIGN OF A TAPER 


A waveguide taper can always be built to have as low a mode conver- 
sion as is wanted in a certain frequency band merely by making it long 
enough. However, an optimally designed taper has the smallest possible 
length for a given difference in diameters at its two ends and for a spec- 
ified unwanted mode level in a given frequency band. 

Some analogies between this problem and the problem of a trans- 
mission line taper of optimum design are evident. The transmission 
line taper for matching impedances is nothing but a tapered waveguide 
in which only one mode is propagating. Power can only be converted 
into reflected waves, and it is this reflected power which is kept small in 
a properly designed transmission line taper. If more than one mode is 
propagating, power will be scattered not only into the reflected wave 
but also into the other propagating modes. In fact, the power scattered 
-into backward traveling waves is quite small compared to the power 
scattered into forward traveling waves, and only the latter need be 
considered in a multimode waveguide taper. Therefore, the mode con- 
version in the waveguide transition corresponds to the reflection in the 
transmission line taper. 

It has been shown that a transition between transmission lines of dif- 
ferent characteristic impedances is optimally made by a series of steps 
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spaced about a quarter wavelength apart.4 The magnitude of these steps 
is chosen to give an input reflection described in its frequency dependence 
by a Tschebycheff polynomial. Similarly, a conical waveguide transi- 
tion is expected to perform optimally when it is composed of a number 
of sections with different cone angles, as in Fig. 2. The lengths of these 
sections should be chosen so that the converted energy from adjacent 
joints of sections is 180 degrees out of phase, and the changes of angles 
from section to section should be chosen so that the over-all conversion 
pattern corresponds to a Tschebycheff polynomial. However, since there 
is more than one mode to which power is converted and since the phase 
constants change along the transition, the design of such a multisection 
transition will be very difficult, if not impossible. 

If the number of sections in the line transition is allowed to in- 
crease indefinitely for a fixed over-all length, a continuous transmission 
line taper is formed. The results of the multistep transition of optimum 
design have been extended to this case.’ The input reflection of the 
continuous taper of optimum design is described in its frequency de- 
pendence by a Tschebycheff polynomial of infinite degree. The taper 
curve itself is given essentially by the Fourier transform of this Tscheby- 
cheff polynomial. The frequency band of constant low reflection now 
extends from a certain lowest frequency to infinity. 

The transmission line taper of optimum design does not have a con- 
tinuous reflection distribution, but it has reflection impulses at both ends. 
Assuming a completely continuous reflection distribution, an almost 
optimum design has been found.® The raised cosine function is a reflec- 
tion distribution which, in practical cases, closely approximates in its 
frequency dependence the input reflection of the continuous taper of 
almost optimum design. 

If the integration in (9) for the mode conversion in the waveguide 











Fig. 2 — Multisection waveguide transition. 
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taper is extended over the parameter p, 


IWalou)| = | [ e@e** ae, (11) 





the relationship of the mode conversion in the multimode waveguide 
taper to the input reflection of the transmission line taper becomes 
evident. In (11) p corresponds to the longitudinal coordinate and &(p) 
to the reflection distribution of the transmission line taper. 

The reflection distribution for an almost optimally designed line taper 
can be substituted for &(p) to give the almost optimally designed wave- 
guide taper. Even the optimum design can be found when some care is 
exercised in using steps in the waveguide taper to simulate the impulse 
functions in the reflection distribution. 

Once £(p) is given, we can find the waveguide radius a(p) and the axial 
distance 2(p) from (4), (5) and (8). The coupling coefficient can be written 


da _ da 
k = e(a) = T c(a) i (12) 
Therefore, 
da _ tan £() 
oo) GV + tan? Fo)’ 


which can readily be integrated by separation of variables to give a(p). 
With a known, IT is also known as a function of p, and the axial distance 
can be calculated: 

’ gd p 

0 Tp)” 

Some restricting remarks must be made in concluding this outline of 
the taper design. . 

Only one of the coupled higher-order modes has been considered here. 
This is adequate as long as the cone angle changes only gradually. Since 
all other higher order modes are weaker in coupling and farther removed 
in phase constant, the conversion to these modes will always be much 
smaller than the conversion to the TE 2 wave. If, however, there are 
abrupt changes in cone angle or even if the diameter of the guide changes 
in steps, as in the waveguide taper of optimum design, the power con- 
version to higher modes must be checked. 

The design procedure outlined here works only as long as the coupled 
mode does not go through cutoff within the transition. Power converted 
to backward travelling waves has been neglected, since the coupling 


(14) 
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to these waves is smaller and they are much more removed in phase 
constant than are the forward travelling waves. In the vicinity of cutoff, 
this is no longer true. Here, the propagation is more suitably described 
by the original generalized telegraphist’s equations (35) and (36) of the 
Appendix. They are, BOMEY GE: ‘not easily solved in the cutoff region of a 
mode. 

Since the coupling coefficients in (85) and (86) do not change with 
frequency, and since the difference in ‘propagation constants of the 
coupled waves does not increase with decreasing frequency, the power 
conversion does not increase when the frequency is lowered to bring a 
coupled mode to cutoff within the taper. Consequently, the waveguide 
taper can be designed for a frequency high enough to keep the lowest 
coupled mode propagating throughout the taper. It will then work pro- 
perly at all lower frequencies. 


IV. THE RAISED COSINE TAPER 


As a characteristic example, we will design a taper whose conversion 
distribution follows the raised cosine function 


to) = © sin? & , (15) 
pl pi 
in which C is yet to be determined from the radii a; and az . According to 


(11) the over-all conversion will then be even by the Fourier transform 
of the raised cosine function 


| Wao _ [a — oY 
C 2ni( 2 ) (16) 


Equation (16) is plotted in Fig. 3. 
We assume the TE and TE modes to be far enough from cutoff so 


that 
N 2 
(*) <1 (17) 


for both modes throughout the taper. Furthermore, we assume the taper 
to be gentle enough so that 


eee eae (18) 


These assumptions will enable us to carry out the integrations analyti- 
cally. 
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Fig. 3 — Mode conversion in the raised cosine taper. 


Integrating (13) with 


= Qhike - 
k-? = ki2 a 
[from (42) and (12)] and &(p) from (15) we get 
nf =in@(2— Jd sin are), (19) 
ay ai\pr 20 pi 


where C has been eliminated by the boundary conditions a(0) = a and 
a(pi) = az: | 

Qhike In a2 
ke? = k? a2 : 


C= (20) 


Because of (17) and (18) 


= kee” = ky 


DS oa (21) 
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The axial distance is found from (14) by substituting (19) for a@ in (21): 


eae | afp 1. 2p 
2 Bp |, exp 217 a 5 sins dp, (22) 


where Bo is the phase constant of free space. With a = 1/z In a:/a; and 
2 = 271(p/p,), the integrand can be expanded in a series 


2 
pe ee "(1 asing +$ sint oF ++) de (23) 


wk. — ky 


and, for moderate values of a, the axial distance z can be calculated by 
term-by-term integration. The total length of the taper is approximately 


28) ax. — ar oe in? = 
2 py OO ey 
Ins ge Int de? FI? > 
ay ay ay 


A numerical example will show the advantage of a properly tapered 
waveguide transition. Suppose a transition designed for the TE. wave 
and connecting { in. 1.D. pipe to 2 in. 1.D. pipe is required to have a spuri- 
ous mode level of less than —50 db for all frequencies up to 75 kmce. 
The most seriously coupled wave is TEx. For a raised cosine taper we 
get, from (20), C equal to 1.28 and consequently, from Fig. 3, p: equal 
to 14.8. Hence, from (24), the required taper length is z; equal to 3 ft. 
The taper curve is plotted in lig. 4. This curve has been calculated from 
(19) and (23), using p as a parameter. A taper of constant cone angle 
satisfying the same requirements would have to be 58 ft. long. 
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Fig. 4 — Raised cosine taper. 
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Circular waveguide tapers have been built according to Tig. 4. Their 
TEoi-TE2 conversion loss was measured at frequencies near 55 kme and 
found to be higher than 50 db. 


V. SUMMARY 


The power conversion from TE: to higher circular electric waves oc- 
curring in conical transitions of round waveguide can be reduced by 
changing the cone angle gradually rather than abruptly. Instead of a 
taper with constant cone angle, a transition with changing cone angle, 
which transforms the cylindrical waves of the round waveguide into 
spherical waves in the transition region, is suggested. 

Upon converting Maxwell’s equations into generalized telegraphist’s 
equations, the transition is represented as a set of nonuniform transmis- 
sion lines, nonuniformly coupled. With proper choice of the coupling 
distribution — and hence the cone angle — as a function of the distance 
along the transition, we can find an optimum design which minimizes 
the length of the transition for a specified frequency range and spurious 
mode level. In a transition of optimum design the mode conversion is 
given by a Tschebycheff polynomial of infinite degree in its frequency 
dependence, and the geometry of the transition isfound from the Fourier 
transform of this Tschebycheff polynomial. A simpler design, but still a 
good one, has a transition geometry given essentially by a raised cosine 
function. 


APPENDIX 


Generalized Telegraphist’s Equations of the Circular Electric Waveguide 
Taper 


A very convenient mathematical formulation of the electromagnetic 
problem in the waveguide transition is provided by 8. A. Schelkunoff’s 
generalized telegraphist’s equations for waveguides.’ Maxwell’s equa- 
tions for transverse electric and circular symmetric waves are, in cylin- 
drical coordinates: 








ae 
“Oz aa jouH, , (25) . 
OH, ee oH, 

as joelig + ae? (26) 


0 . 
ap (rE 6) = —juwprH,, (27) 
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where I, , H, and H, are the only nonvanishing components of the field, 
e is the dielectric permittivity, » the magnetic permeability and w the 
angular pulsation. The exponential dependence of time is understood. 

The boundary conditions of the waveguide taper are, at 7 equal to a: 


E,=0 (28) 
da 
H, = pe: (29) 


The field at any cross section of the paper is represented as a super- 
position of the fields of the normal modes in a cylindrical guide of the 
same cross section: 








| 
Tee 
Rey (3 5 (30) 
a WS ead (Km)? 


r 
2 ee (5) (31) 
m i VJ ad (Km) ; 


km | 
x = mae ( ‘)| aS 


where Jo and J; are Bessel functions of the Fist. kind and k,», is the mth 
zero of J1. The Vm, Im and tm have the dimensions of voltages and cur- 
rents. The factors of V, and J, are normalized so that Pm = 1/2(VimIm*) 
is the complex power flow in each normal mode. It has to be kept in 
mind that a is a function of z. 

_ The boundary condition (28) is satisfied by the individual terms of the 
‘series for ,. Hence, this series converges uniformly. Not so the series 
for H,: (81) is a representation for H, only in the open interval 
0 Sr <a, since the individual terms vanish at r = a but, according to 
(29), H, does not. In the closed interval, the series (31) represents a dis- 
continuous function and therefore does not converge uniformly. Term- 
by-term differentiation will make the series diverge. 

The relationship between 7,, and V» is found by substituting in (27) 
the series (30) for HE, and the series (82) for H, , and comparing coef- 
ficients 


Rs 
| 


H, 


ae ae (33) 
wpa 
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Using this relation and substituting the series (30) for Z, in (25) at r = a, 
we find that the boundary condition (29) is satisfied. 

To convert Maxwell’s equations into generalized telegraphist’s equa- 
tions, we introduce (30) and (81) into (25) and (26), multiply both sides 
of both equations by J,[k,(r/a)] and integrate over the cross section. 
Since the series for H, does not converge uniformly, we write for the left- 
hand side of (26) 


rd (i, r) ony ala (i "| que Ee (i, | i, (34) 
a} Oz a 0z a 


and invert integration and differentiation in the second term of this 
expression. 
The generalized telegraphist’s equations have the following form: 








avn a a 2hnkm 
dIn _ Buy ae da Qknkm 
dz ree piliy the, (36) 


The summations are extended over all m except m = n. The quantity 
8, is the phase constant of the nth mode in a cylindrical guide of the 
particular cross section; 8, is a function of a and therefore of z. 

The generalized telegraphist’s equations represent an infinite set of 
coupled nonuniform transmission lines. For our purpose, it is convenient 
to write the transmission-line equations not in terms of currents and 
voltages, but in terms of the amplitudes of forward and backward travel- 
ling waves. Thus, let A and B be the amplitudes of the forward and back- 
ward waves of a typical mode at a certain cross section. The mode cur- 
rent and voltages are related to the wave amplitudes by 


V=vVK (A+B), (87) 


ety) =. 
ii ve B), (38) 


where K is the wave impedance 


Kn (39) 

oe 
If the currents and voltages in the generalized telegraphist’s equations 
(35) and (36) are represented in terms of the traveling-wave amplitudes, 
after some obvious additions and subtractions, the following equations 
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for coupled traveling waves are obtained: 











dAn ee — ] K, + —_ 
BASS K Br + i (kam? Am + Iinm Bm) (40) 
OB eos: i Ke eos, ie 

Te 7 FBnBn —5 Fe An + pS (hee Bu — Kam Am). (41) 


The k’s are coupling coefficients defined by: 


Knkem K,,\ 1 da 
Km ” Kat — Bee (4/e + 4/Ke)t a dz (42) 


and 


dK, 


Ooo 
kn = °o 





For a cylindrical guide, da/dz is equal to zero and (40) and (41) reduce 
to uncoupled transmission-line equations. 
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The Nonuniform Transmission Line 
as a Broadband Termination 


By IRA JACOBS 
(Manuscript received December 17, 1957) 


The problem of obtaining a broadband microwave termination ts con- 
sidered from the point of view of nonuniform transmission line theory. At- 
tention is restricted to lines in which only the distributed shunt admittance 
may be varied. An optimization argument is presented which leads to the 
consideration of a line in which the fractional increase in admittance per 
wavelength in the line is constant. The nonuniform transmission line equa- 
tions are solved exactly for this case, and the results are expressed in terms 
of readily interpretable elementary functions. It 1s shown that a fixed geo- 
metrical length of line can lead to an arbitrarily large effective length without 
destroying the match at the input. The introduction of a small loss term 
makes the line almost totally absorbing regardless of its termination. 

The line has a long-wavelength cutoff given by 4x times the actual length 
of the line. If the line is short-circusted at its far end, a return loss of greater 
than 11.4 db is obtained at all frequencies above twice the cutoff frequency. 
The effect of certain practical limitations on the performance of this line is 
also discussed. 


I. INTRODUCTION 


Classical transmission line analysis leads to the propagation of a wave 
in which neither the electric nor magnetic fields have components in the 
direction of propagation.! These transverse electromagnetic (TEM) 
waves are characteristic not only of the usual transmission line structures 
such as parallel wires and coaxial cable; they are also characteristic of 
plane-wave propagation in isotropic media. 

It is often convenient, when dealing with TEM-wave propagation, to 
make use of results of classical transmission line analysis. Some care 
must be exercised, however, in applying these results at microwave fre- 
quencies. Consider, for example, the problem of terminating a lossless 
line. Classical analysis tells us that, if the line is terminated in its charac- 
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teristic impedance (which is a pure resistance for a lossless line), the 
termination will be reflectionless. Two difficulties arise at microwave 
frequencies, where the physical dimensions are no longer small com- 
pared to the wavelength. First, as the frequency increases, the concept 
of a lumped circuit element becomes less meaningful. For example, a 
resistive disc termination for a coaxial line will have an effective im- 
pedance which is strongly influenced by the geometry and has very little 
correlation with the de resistance of the disc.2 The second difficulty is 
that, even if the appropriate effective lumped impedance is obtained, 
the analysis assumes that this impedance is connected across an open 
circuit. An open-ended line at high frequencies is by no means an elec- 
trical open circuit. There is a true open circuit, however, one-quarter 
wavelength in front of a short circuit. Therefore, if the line is short- 
circuited by a metallic surface, and the appropriate characteristic im- 
pedance is placed one-quarter wavelength in front of the short, the 
termination will be reflectionless. However, if the frequency is changed 
the quarter-wave condition is destroyed, so that the termination is not 
broadband. 

The purpose of this paper is to consider analytically the use of a non- 
uniform transmission line as a broadband termination. If this structure 
is to be finite, it too must be terminated. To make the results independ- 
ent of what is beyond the nonuniform line and still maintain physical 
realizability, it will be assumed that the nonuniform line is terminated 
by a short circuit. The problem, then, is to match from a given charac- 
teristic impedance, Zo, to a short circuit by means of an appropriate 
nonuniform transmission line. 

There is extensive literature’ on the use of nonuniform transmission 
lines for impedance matching. Optimum matching procedures have been 
discussed‘ for matching two uniform lossless lines to one another by 
means of a lossless nonuniform line. In addition to the assumption of no 
loss, it is assumed that the magnitude of the reflection coefficient is 
much less than unity at all points along the line. Both of these assump- 
tions, however, are not applicable for matching to a short circuit. Since 
the object here is to absorb all incident energy, the line cannot be loss- 
less. Also, the magnitude of the reflection coefficient is unity at the short 
circuit. Thus, the matched-termination problem must be considered 
apart from the usual matching problem. 

In order to define the problem more precisely, it is necessary to con- 
sider the transmission line equations which determine the voltage V 
and current J along the line: 
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Lae ee 1) 
dx 
eV: (2) 
dx 


where Z (a) is the distributed series impedance per unit length and Y (2) 
is the distributed shunt admittance per unit length. The line is said to 
be uniform if both these quantities are constant. 

A uniform line may be used to match from an impedance of Z»y to a 
short circuit if Z and Y are both large complex constants (so that the 
wavelength in the line is small and the losses are large) and if, in addi- 
tion, 1/Z/Y equals Z,. Thus, in principle, a fixed length of uniform 
line can be made electrically long and extremely lossy, and yet it may 
have the same characteristic impedance Zp as does the line it is desired 
to terminate. In practice, however, this may be difficult to accomplish. 

Consider, for example, the problem of finding a microwave-absorbing 
material for anechoic chambers. Using the above principle it would be 
necessary to find a material with large complex relative permeability 
and relative dielectric constant, but such that the ratio of these two 
quantities was unity. It is not too difficult to obtain high dielectric con- 
stants at microwave frequencies,® but it is difficult in general to obtain 
equally large permeabilities. It is of interest, then, to consider a medium 
with a permeability equal to that of free space, and to attempt to match 
this medium to a short circuit by increasing the dielectric constant as 
the termination is approached. In the analogous transmission line prob- 
lem, the distributed series impedance, Z(x), is a fixed constant, and only 
the distributed shunt admittance, Y(x), is at our disposal. 

The following problem is thus suggested. A uniform lossless line, 
characterized by a distributed series impedance per unit length of jwL, 
and a distributed shunt admittance per unit length of 7wC1, is to be 
terminated by a nonuniform line of length s (see Fig. 1). The nonuniform 
line has a constant distributed series impedance per unit length of jwL, , 
but the distributed shunt admittance per unit length, Y(a), is as yet 
unspecified. This line is, in turn, terminated in a short circuit at x = s 
it is desired to find that function Y(x#) which minimizes the reflection 
coefficient at « = 0. 

The general variational problem is outside the scope of the present 
work. In the following sections a somewhat intuitive argument will be 
presented which leads to a particular form for Y(x). An exact solution 
of the transmission line equations will be obtained for this particular 
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nonuniform line, and it will be shown that the line indeed has the prop- 
erties indicated by the simpler intuitive arguments. 


II. DETERMINATION OF THE ADMITTANCE VARIATION 


Approximation techniques’ indicate that, if the fractional change in 
the properties of the line per “local wavelength over 27’’* is small, then 
reflections may be assumed to be negligible. The exact solutions of par- 
ticular nonuniform transmission lines, such as the exponential line,® in- 
dicate that reflections become important when this condition is violated. 

Since we are attempting to match from an admittance level jwC, to 
an infinite admittance, it is natural to ask the following question: How 
large can the admittance be made at x = s, subject to the conditions 





L,dx L,dx 
=o am ans ow oe eam ome om ene oe ome ne ome —_ = os om 
C,dx C(x)dx G(x)dzx 
ere See — oo oe 
Geese educa! 






LOSSLESS LINE, GENERATOR AT -00 TERMINATING SECTION 


Oo Ss 
Fig. 1 — Nonuniform transmission line termination. 


that it be equal to jwC; at x = 0 and that the fractional change in ad- 
mittance per local wavelength in the line be small? 
The wavelength in the uniform lossless line characterized by LZ and 
Cy is 
eee ee 
@ VLC 


The local wavelength in the nonuniform line is given by \/+~/e(x), where 


Y(x) _ C@) _ jG@) 
joC C1 wCy . 


in which C(a) is the distributed shunt capacitance per unit length and 
G(x) is the distributed shunt conductance per unit length. For the pur- 


(3) 


(4) 


* “Local wavelength” is defined as the wavelength in a uniform line which has 
the same distributed constants as the line in question at the point in question. 
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pose of the present section, it will be assumed that the nonuniform line 
is lossless, so that G(x) is zero and e(x) is real. The condition that the 
fractional change in admittance per local wavelength over 27 be small 
can then be written 


A 1 de (5) 


Dn a de = 
where a is an as yet unspecified small constant. 

The problem to be considered is thus: Given s greater than zero, find 
that function e(z) which maximizes ¢(s), subject to the conditions that 
«(0) equals 1 and that the inequality (5) is satisfied over the interval 
[0, s]. The solution to this problem is obtained by replacing the in- 
equality in (5) by an equality, which leads to the result 


eo) = (1-2)? 6) 


Thus, as x approaches \/za, e(x) becomes infinite. It would then appear 
that, in a fixed length of line, an arbitrarily large change in admittance 
can be utilized, and consequently a large effective length obtained, 
without violating the slowly varying condition. The introduction of a 
small imaginary component (shunt conductance) to e(a) should then 
make the line totally absorbing regardless of termination. 

In order to verify the above conjectures analytically it is necessary 
to solve the transmission line equations. This will be done in the follow- 
ing section. 


IH. SOLUTION OF THE TRANSMISSION LINE EQUATIONS 


For a line with uniformly distributed series impedance per unit length 
jwL, and distributed shunt admittance per unit length jwCie(x), the 
transmission line eae may be rewritten in the form 


ae r+(2 = eax)V = 0, (7) 
oa teas + (#) or = 8) 


where \ is given by (8). 

If e(x) is of the form (A + Bz)”, (7) may be transformed® into Bessel’s 
equation of order 1/(n + 2). The fact that considerable simplifications 
result when n equals —2 has been noted previously,” but the physical 
implications do not appear to have been discussed. 
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The expression for e(x), given by (6), is real. Since we are interested 
in absorption, the shunt admittance must contain a conductance term 
in addition to the capacitance, and consequently e must be complex. In 
order to maintain the same functional dependance for e, it will be as- 
sumed that the ratio of shunt conductance to shunt capacitance is con- 
stant, 


so that e(2) is given by 
1-4 . 
ex) = =) (10) 
1- — c) 
d 
If the change of variables : 
r= —In (1 = = 2) (11) 
is made, (7) and (8) can be rewritten 
dV dv ,4 , 
dI di ,4 : 
ae Ge + a — Jo/w)d = 0, (18) 


where the expression for € as given by (10) has been used. The solutions 
of these equations are 


V=e"™ (Vie + Vee" (14) 


and | 
r= j$VOjie"|- (1 +5) re" + (7-5) v0" |, 8 
where 


_ 1 4 . 
v= 4/4- 51 - fo/e) (16) 
may be interpreted as an effective propagation constant and r may be 
interpreted as an effective length. 

The lossless line (¢ = 0) is cut off when the effective propagation con- 
stant becomes real; that is, when a = 4. If the shunt capacitance, C(x), 


is to be frequency-independent, it follows from the form of e(x) that a 
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must be proportional to wavelength. A cutoff wavelength Xo and cutoff 
frequency wo may then be defined such that 


a= — = —, (17) 


Thus, y can be written as 


_ fil a . @ 
v= (B)4/1- 44 ie (18) 


It is interesting to note that these results are identical in form to the 
results that would be obtained for an exponential line,’ in which the ad- 
mittance and impedance vary as e”* and e— respectively, and where r 
is defined as ax. However, in order to obtain the same effective length, 
R, the actual length of the exponential line must be longer by a factor 
R/(1 — e”), which increases linearly with R for large effective lengths. 
Thus, the equivalent exponential line will, in general, be considerably 
longer than the line considered here. 





IV. REFLECTION COEFFICIENT 


If the boundary condition, V = 0 at x = s, is substituted into (14), 
the following relation is obtained for the input admittance: 


NSO) 4 4/5 ) 


where F is the value of r corresponding to « = s; that is, 


R= —In (1 e * 5). (20) 
do 
The complex voltage reflection coefficient, p, is given by 
= VG/lh — Ys (21) 
V/C1/ Ty + Y; 


Substitution of (19) into (21) gives 


1+ 7 + 237" coth yR 

p= —_-________. (22) 

1— j= 2iy — coth yR 
(23) @ 


For the reflection coefficient to be small over a broad frequency band, 
it is necessary that the transmission loss be large; that is, the real part 
of yR must be large. For any nonzero loss, ¢ > 0, one can, in principle, 
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Y(x)=jwC(x) + G(x) 











Clo) = C 
G(x)/C(x) =o" 





















































T/W 
Fig. 2— Admittance ratio required for 13-db transmission loss. 


make the effective length of the line, 2, sufficiently large so that the 
transmission loss is as large as desired. It follows from (20) that this 
can be accomplished in a line whose length is 


Be Ga 7), (23) 
Ar 
Thus, if the physical length and the desired effective length of the line 


are specified, the cutoff wavelength can be determined from (23). For 
large R, the cutoff wavelength is essentially given by 4s. 
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A large value of 2 is obtained by approaching the singularity in ¢(2). 
The real part of e(a), at « = s, is given by 


Gea (1-H) a ee 





so that exceedingly large values of distributed shunt capacitance are re- 
quired to obtain moderately large effective lengths. 

The above formalism may be used to calculate the distributed shunt 
capacitance at x = s required to ensure a given transmission loss. This: 
result will, of course, depend on the value of the loss parameter, c/w» . 
As an example, in Fig. 2 the value of C(s)/C; required to ensure a mini- 
mum transmission loss of 13 db for all w = wy is shown as a function 
of ¢/wo .* If o/wo is very much less than one, an astronomically large 
value of C(s)/C1 is required. Although this may not be realizable in prac- 
tice, it gives an upper limit to the ideal behavior of the line. 

If the transmission loss is sufficiently large, 


coth yR = 1. (25) 


If, in addition, ¢/w9 is neglected in comparison to unity, it follows from 
(18) and (22) that 


2 ies 1—- V1 — o?/ a 26 


Equation (26) gives the intensity reflection coefficient of the ideal line. 
The return loss (— 10 logio | p |?) is plotted as a function of frequency 
in Fig. 3. The return loss is zero at the cutoff frequency, but increases 
rapidly as the frequency is increased. t 

As a more practical example, the return loss will also be calculated for 
a/wo = 2. It follows from Fig. 2 that this requires C(s)/C to be 400 to 
ensure a 13-db transmission loss. This choice, in addition to being physi- 
cally reasonable, leads to some computational simplifications. It follows 
from (18) that, if ¢/wo is 2, then 


=3(1+52), (27) 


* The 13-db transmission loss requirement is equivalent to | e?"* | = 0.05. It 
can be shown from (18) that the real part of y has aminimum, a» , given by the 
smaller of Vo/8wo and o/4wo . These two results, together with (24), were used 
to obtain Fig. 2. 

+ In Section II the parameter a was introduced, and it was assumed that re- 
flections would be small if a were small. It has since been shown [in (17)] that 
a = 4/(w/wo). Fig. 3 then gives a quantitative demonstration of the initial sup- 
' position. For a = 4, there is total reflection. As a decreases (w/wo increases), the 
reflection is seen to decrease rapidly. 
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RETURN LOSS IN DECIBELS 









































Fig. 3 — Return loss of the ‘‘ideal line’’. 


so that the effective attenuation constant (real part of y) is frequency- 
independent. The effective length of this line is R = 3. For these values 
of y and R, coth yR varies between 0.9 and 1.1 as the frequency is 
changed. If, as before, it is assumed that coth yR = 1 (thus assuming 
R > 1, which is equivalent to neglecting interference effects due to 
multiple reflections), the reflection coefficient of the line is given by 


2 = eae ee 
1 + (w/wo)*” 


If this approximation is not made, the exact expression for | p |? is more 
complicated but still easily amenable to numerical evaluation. The solid 
curve in Fig. 4 gives the return loss as a function of frequency, as eval- 
uated from the exact expression. The dashed curve gives the return loss, 
neglecting interference effects, as determined from (28). In the frequency 
range depicted, the exact return loss is seen to oscillate about the value 
obtained when interference effects are neglected. However, in the high- 
frequency limit (w >> wo) the two curves diverge. The return loss in- 
creases without limit if the effective length of the line is infinite. How- 
ever, for R = 3, the return loss at high frequencies approaches 26 db, 
which is just the two-way transmission loss of the line. . 

It is seen from Fig. 4 that there is a 3-db return loss at the cutoff fre- 
quency of the “ideal line.”” However, as the frequency increases, the loss 
increases only gradually; at w = 4wo the return loss is 13.3, db as com- 


| p (28) 
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Fig. 4 — Return loss of a “practical line’’. | 


pared to 17.8 db for the ideal line. Thus, practical limitations on maxi- 
mum shunt capacitance have an appreciable influence on the behavior 
of the line. 


Vv. CONCLUSIONS 


Properties of nonuniform transmission lines have led to the considera- 
tion of a line in which the fractional change in shunt admittance per 
wavelength in the line is constant. The transmission line equations have 
been solved exactly for this case. The solution indicates that a fixed 
length of line s can be made to have as large an effective length as de- 
sired. Hence, with the introduction of a small loss term, all energy 
matched into the line is essentially completely absorbed regardless of 
the line’s termination. 

It has been shown that the line has a long wavelength cutoff given es- 
sentially by \o = 42s. As the frequency increases beyond cutoff fre- 
quency wo, the reflected intensity from the short-circuited line dimin- 
ishes rapidly, being 11.4 db down at 2w» and 17.8 db down at 4wo, as 
seen from Fig. 3. 

If practical considerations limit the maximum shunt capacitance, it is 
necessary to use a larger shunt conductance to obtain the same trans- 
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mission loss. This degrades the performance of the line somewhat, as 
shown by the example in Fig. 4. If a return loss of 10 db is required, it 
is seen from Fig. 4 that the line must be operated at frequencies above 
3.1 wo. Thus, the length of the line would be s = \o/4a = 0.25, where 
d is the longest wavelength for which the return loss would be equal or 
greater than 10 db. This length is one-third of that which would be re- 
quired with the equivalent exponential line. 

The “ideal line’, as indicated by Fig. 3, gives a 10-db return loss at 
1.75 wo. The length of line required for a 10-db absorption would then 
be 0.14\. The differences between the ideal structure and the practical 
example become even more pronounced as greater absorption is required. 
However, practical structures may approach the performance of the 
ideal line if one considers a variation of the loss term ¢ in addition to the 
variation C(x). | 

The nonuniform transmission line analyzed here may be considered 
to be a singularity of the general Bessel line. It is of analytic interest be- 
cause the solutions are in the form of readily interpretable elementary 
functions. It is also of physical interest because the particular varia- 
tion of line parameters is suggested by a common approximation pro- 
cedure for analyzing nonuniform transmission lines, and because the 
exact solutions indicate that the line has desirable properties as a broad- 
band termination. 
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Using Contact Resistance to Measure 
Adsorption of Gases on Metals 


By P. KISLIUK 
(Manuscript received December 5, 1957) 


The contact resistance between electrodes of molybdenum and tungsten is 
used to investigate the adsorption of nitrogen and oxygen. It 1s found that, 
on clean metals, the initial period of fast adsorption gives rise to a surface 
resistivity of about 4 X 10-'© ohm-cm?, corresponding to a layer of the 
order of 1.6 Angstroms thick on each surface. To remove, within a minute 
or so, the layers resulting from adsorption beyond this initial period, tem- 
peratures near 1700°K are required for all four metal-gas combinations. 

Some of the oxygen adsorbed at a pressure of a few torr* at room tempera- 
ture can be removed by lowering the pressure, but such an effect was not 
observed for nitrogen. Nitrogen or oxygen adsorbed during the initial period 
of fast adsorption on clean molybdenum or tungsten at a temperature of 
100°K is not removed in times of the order of one hour by lowering the 
pressure, in disagreement with some results in the literature. The rate of 
migration of adsorbed oxygen into and out of the region between closed 
contacts is found to be much lower than that reported in an earlier experi- 
ment of the same type. 


I. INTRODUCTION 


Measurement of the contact resistance between metal electrodes can 
be used to study surface films of nonconducting substances such as ad- 
sorbed gases. In comparison with other techniques it has the following 
advantages: 

1. It can be used in the presence of the gas. 

2. It can be used over a wide temperature range. 

3. It does not depend primarily on the change in work function, 
which might be small for some adsorbates. 

* Throughout this paper the unit of pressure will be the “‘torr’’, a unit widely 
used in Europe to designate one millimeter of mercury of pressure, i.e., 1.8382 X 
10? dynes/cm?. (See, for example, W. H. Westphal, Phystkalisches Wérterbuch, 


Springer Verlag, Berlin, 1952). This avoids the awkward ‘“‘millimeters of mercury 
of gas X.” 


925 
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-4, Under many conditions it does not destroy the surface film. 

5. Adsorption can be studied on the bulk metal, which can be degassed 
by vigorous heating; i.e., dispersed conditions such as powders or 
porous films are not required. 

In general, the contact resistance arises from two effects, the constric- 
tion of the lines of current flow into a relatively small region of contact 
and the resistance offered by a surface film. The pioneering paper in 
this field is due to Holm and Meissner,! who resorted to low tempera- 
tures to minimize the constriction resistance and showed that, for rea- 
sonably clean metals, there remained a resistance due to the surface 
film of the order of 5 X 107° ohm-cm?. Although the method has been 
applied by a number of authors under a wide range of conditions,’ * '° 
only J. J. Went" attempted to obtain clean surfaces under high vacuum 
and then to investigate the adsorption of gases under controlled condi- 
tions. The present investigation is an attempt to extend these measure- 
ments using the improved vacua now available, and to interpret the 
results in terms of the advances in the knowledge of adsorption gained 
since that paper was published. 


II. PRINCIPLES OF THE METHOD 


For a single circular contact area of diameter, d, much smaller than 
the dimensions of the electrodes, the “constriction” or “‘spreading”’ 
resistance is given by (Ref. 2, p. 16): 


R, = p/d, (1) 


where p is the bulk resistivity of the metal. 
A surface film gives rise to an additional resistance inversely propor- 
tional to the contact area (Ref. 2, p. 79), A: 


Ry = 01/A, (2) 


where a; is called the ‘surface resistivity.” 

Although R, is not completely independent of A, because of the re- 
arrangement of the lines of current flow, the change is negligible for our 
purposes (Ref. 2, p. 18), and we may write: 


Since FR, is proportional to 1/d and R; is proportional to 1/d?, smaller 
diameters of the contact area lead to an increase in the ratio R,/R,. 
Thus, for sufficiently small contact forces and hard metals it is not 
necessary to go to extremely low temperatures to measure Ff. In this 
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manner Went! was able to perform some of his experiments on molyb- 
denum at room temperature. 

The theory accounting for the resistance R&; for films less than about 
20 Angstroms thick, valid for small potential drops (less than a few 
tenths of a volt), is based on a model in which the adsorbed gas is re- 
garded as simply a spacer holding the metal contacts apart. The elec- 
trons flow by means of quantum mechanical penetration of the potential 
barrier. The result of calculations based on the potential barrier illus- 
trated in Fig. 1, is:?1218-14 


hl (# ) 3 
= exp {| — V2 statohm-em 
Ot 2 i p a mp 


3 X 10°" (I/+/e) exp (1y/¢) ohm-cm’, 


where h, e, and m have their usual significance, / is the thickness of the 
film and ¢ the work function of the metal. The second form of the ex- 
pression is valid for ¢ in volts and J in Angstrom units. 

This resistance is ohmic (i.e., current proportional to voltage) accord- 
ing to both theory and experiment.!: ? A further refinement involves the 
inclusion of the electron image forces,” “4 and the result of this calcula- 
tion is plotted in Fig. 2. It may be seen from (4) that the exponential 
factor is dominant so long as l+/g is greater than one and that, under 
this condition, the resistance is extremely sensitive to / and slightly less 
sensitive to y. This is the case for all practically attainable values of 1 
and ¢. 

To interpret the data one must have some knowledge of the dimen- 
sions of the contact area. For perfectly smooth electrodes under most 
of the experimental conditions, the contact force could, in principle, be 
supported elastically. Because of the roughness of practical surfaces, 





(4) 


I 


I 


Fig. 1 — Potential diagram illustrating the model used in the derivation of 
equation (4). 
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Fig. 2 — Contact resistivity, o, , versus film thickness for various values of the 
work function. The electron image force has been taken into account, modifying 
the potential barrier illustrated in Fig. 1. 


however, the load is in fact supported by squashing a small contact 
region inelastically. Under this condition the contact area can be calcu- 
lated approximately if the hardness of the metal is known: 


where P is the contact load in grams and H ig the hardness on the 
Vickers scale. 


III. EXPERIMENTAL APPARATUS 


The apparatus used in these experiments (Fig. 3) consisted essentially 
of two mechanically polished crossed wires (diameter 15 X 10-3 in.) 
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mounted on flat molybdenum springs. The lower of these was connected 
to a quartz yoke which could be pulled up by a micrometer screw operat- 
ing through a sylphon bellows. The force constant of the spring on which 
the upper wire was mounted was measured both before mounting and 
after dismantling, and the constancy of the force constant as the spring 
was cycled between room and liquid nitrogen temperature was verified 
in a separate experiment. The motion of the upper spring as a function 
of the micrometer setting was calibrated with a cathetometer. 

The tube was mounted in a vacuum system (mercury diffusion pump 
and liquid nitrogen trap) capable of reducing pressure to the order of 
10-° torr as measured on an ionization gauge. A gas-handling manifold 
was provided and isolated from the system by a Westinghouse-type 





























Fig. 3 — The tube in which these experiments were performed. 
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vacuum valve. The gases were of spectroscopic grade supplied by the 
Air Reduction Company. The test wires could be flashed by conduction. 

The contact resistance was measured in the region near 50 milliam- 
peres and one millivolt by a four-terminal method using a milliammeter 
and microvoltmeter, exactly as described by Went." 


IV. EXPERIMENTAL RESULTS 


Experiments were performed on tungsten and molybdenum electrodes 
using nitrogen and oxygen as adsorbates. The behavior of the two metals 
is very similar, and that of the two gases differs only at relatively high 
pressures. The experiments are subdivided as follows: 

1. The behavior of clean metals. 

2. The resistance due to gas adsorbed at room temperature. 

3. The temperature of flash necessary to remove the adsorbed film in 
a reasonable time. 

4, Adsorption at lower temperature. 

5. The creeping in and out of adsorbed gas between closed contacts. 

One is limited in these experiments by the scatter in the data. In a 
run of ten closures at the same force the largest value of the resistance 
was usually about twice the smallest, and the average deviation from the 
mean about 20 per cent. After the electrodes had been exposed to the 
same conditions of gas pressure for equal intervals at widely different 
times during the course of the experiment even the average of ten re- 
sistance measurements sometimes varied by as much as a factor of two. 
This scatter is partly due to differences in local hardness because of the 
polycrystalline nature of the wires, but also could be caused by differ- 
ences in annealing, as the wire was flashed and then work-hardened in 
the contact region. It may also be due to local differences in adsorptive 
properties. In any case, as a consequence little weight can be attached 
to small differences and only fairly gross effects can be interpreted 
reliably. . 

Some of the relevant physical properties of tungsten and molybdenum 
are given in Table I. 


4.1 Clean Surfaces 


The metal surfaces were presumed to be clean after having been 
flashed repeatedly at a temperature of 2000°K for molybdenum, and 
at a somewhat higher temperature for tungsten, in a measured residual 
gas pressure of about 2 X 10-'° torr. By measuring the increase in the 
maximum pressure during a flash as a function of the cold interval, it 
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TaBLE I — PrRoprErtTIES OF MOLYBDENUM AND TUNGSTEN 














Present Work* Other References 
Resistivity, p, 300°K. Mo — 5.9 XX 10-* ohm-cmf 
W — 5.85 X 10-§ ohm-cmt 
; estes p300°K Mo 10.2 9.3t 
Ratio of Resistivity, 
p738°K WwW 9.3 8.4} 
Hardness, H, 300°K Mo 230 160-2258 
W 480 290-488§ 
A H300°K —_— 
Ratio of Hardness, os 7 eae = 








* Measured resistivity ratios are for wires after outgassing in high vacuum. 
The hardnesses were measured on larger pieces from different samples. In the 
case of tungsten, the hardness samples had also been vacuum annealed, but the 
molybdenum hardness sample was not so treated. 

+ Ref. 16, pp. 2357-2363 

t Refs. 17 and 18 

§ Ref. 19, pp. 1140, 1144 


was verified that about five hours were required to reach a constant 
value. The experiments to be reported in this section were performed 
within a period less than one-half hour after the flash. 

For a force of 1 gm (+5 per cent) the contact resistance averaged 
between 10 and 16 milliohms for molybdenum and between 14.5 and 
28 milliohms for tungsten. This range of values is for different samples 
and for measurements taken at different times during the course of 
these experiments. If the values of the hardness and resistivity listed 
in Table I are used in equations (1) and (5), one computes 22 and 36 
milliohms for the two metals. Considering that the load is supported 
elastically to some extent and that the contact region may not be exactly 
circular, this agreement is satisfactory. It is assumed in latter portions 
of this paper that contact areas obtained from the measured resistivities 
under these conditions are more reliable than those computed from the 
hardness. 

The contact resistance between clean metal electrodes, according to 
(1) and (5), should vary as the inverse square root of the force, and it 
does so within the limits of experimental error, as can be seen in one of 
the curves of Fig. 4.* However, to obtain this result it is necessary to 

* For the opposite extreme, when R,> FR, rather than R;, > R; , one would ex- 
pect #& to vary as the inverse first power of the force if the load is supported by 
plastic deformation. The other curve of Fig. 4, obtained for the thickest film ob- 


served during the course of these experiments, shows that this is also verified ex- 
perimentally. 


932 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 











W 
3° 
[o) 
[eo] 








N 
ES 
[2] 
oO 








AFTER EXPOSURE TO 6MM Hg 
OF Oz FOR 24 HOURS 


)- 





1800 





tw 
(o) 
jo) 





RESISTANCE IN MILLIOHMS (=——) 


RESISTANCE IN MILLIOHMS ( 








600 


1/4 1/2 1 2 
FORCE IN GRAMS 


Fig. 4 — Contact resistance versus contact force for clean tungsten electrodes 
and for electrodes which have been exposed to 6.5 torr of oxygen for 24 hours. Each 
point is the average for five closures. 


increase the force to its final value and not to overshoot and come back. 
The effect of cycling is shown in Fig. 5. There is also a relatively slow 
increase in contact area with time and a typical curve of the resulting 
decrease in resistance is shown in Fig. 6. 

The assumption that the load is supported on regions formed, for 
the most part, by plastic flow is thus confirmed by the following obser- 
vations: 

1. The approximate agreement of the measured value of R, with that 
computed on this assumption. The contact resistances computed for an 
elastically supported load are considerably smaller than those found 
experimentally. 

2. The slopes in Fig. 4, which are 0.5 clean and 1.0 with a thick film, 
as expected for plastic deformation, rather than 0.3 and 0.7, as would 
be expected for purely elastic deformation. 
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Fig. 5 — The contact resistance of clean tungsten with increasing and decreas- 
ing contact force. The numbers on the experimental points give the sequence of 
the measurements. 


3. The irreversibility exhibited in Fig. 5. 

4, The slow changes shown in Fig. 6. 

The second point also shows that any surface film still present on the 
nominally clean surface contributes very little to the measured resist- 
ance. 

The results of attempts to check the effect of lowering the tempera- 
ture on the contact resistance of clean metals were inconclusive because: 

1. The long time (about one-half hour) required for the contacts to 
cool down to liquid nitrogen temperature after flashing made the vacuum 
requirements much more severe. 

2. Inadequate radiation shielding made the actual temperature of the 
contacts uncertain. 
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Fig. 6— The contact resistance between closed clean tungsten contacts (force = 
1 gm) as a function of time. The tube was jostled slightly by passing footsteps at 
the time indicated. 
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For both metals, the contact resistances measured when the contacts 
were closed with the tube immersed in a liquid nitrogen bath were lower 
by a factor of about four than the contact resistances measured at 
room temperature with the same contact force. The resistivity and hard- 
ness ratios of Table I, however, would lead one to expect a factor of 
about seven.* 

In summary, the results of these experiments are consistent with the 
behavior expected for clean metals plastically deformed by the contact 
load. The contact areas for a force of 1 gm at room temperature are 
about 2 X 10-7 cm? for molybdenum and 5 X 10-° cm? for tungsten. 


4.2 The Contact Resistance Due to Gases Adsorbed at Room Temperature 


A series of experiments in which the contact resistance was measured 
after tungsten contacts were exposed to nitrogen and oxygen at various 
pressures and for various intervals is reported in Appendices A and B 
and summarized in Figs. 11 and 12. The gas was usually admitted by 
opening a metal valve very slightly and allowing the gas to leak in while 
the system was continuously being pumped. Because of the pumping 
action of the ion gauge and the glass walls, which decreased as they 
became saturated, the pressure showed a steady increase with time. 
However, for nitrogen, except at the lowest pressures, the effect was 
small, so that the rate of change in pressure could be ignored in comput- 
ing the pressure-time product, here called the ‘‘exposure’’. For oxygen, 
on the other hand, the effect was quite large and had to be taken into 
account. The corresponding desorption during the interval of pumping 
out the gas resulted in higher pressures after a given time with oxygen 
than with nitrogen. 

To interpret these results, it is helpful to review some of the results 
of measurements obtained by other methods. When nitrogen (and per- 
haps oxygen) molecules strike a clean molybdenum or tungsten surface 
at room temperature, the sticking probability, s, is relatively constant 
and of the order of tenths until a coverage of roughly 1.7 K 10" mole- 
cules per cm? is attained, at which point it falls off rather sharply to a 
much lower value.2® 2!) 2 Thus, after a few seconds at 10~® torr or a few 
minutes at 10° torr, this coverage is attained, after which the coverage 
changes relatively slowly. The sticking probability continues to fall off 
with increasing coverage until, after a considerably longer time (of the 
order of 15 minutes for nitrogen at 10° torr, when the coverage is above 
3.5 X 10 molecules per cm?), the sticking probability remains relatively 
constant at a value of roughly 10-*. When the coverage reaches approxi- 


* Went"! reports a factor of only 2.3. 
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mately 5.5 X 10" molecules per cm?, the sticking probability for ni- 
trogen falls off to an immeasurably small value.”! The “exposure”’ for 
the first step of rapid adsorption is, for nitrogen, approximately 3 X 107% 
torr-min, and exposure needed for a relatively constant low value of s 
is approximately 10-° torr-min. To attain the immeasurably low stick- 
ing probability the exposure is roughly 10-* torr-min.* 

The results given in Appendix A and Fig. 11 for nitrogen on tungsten 
may be summarized as follows: Exposures corresponding to the comple- 
tion of the critical period of rapid chemisorption give rise to a surface 
resistivity of 3.5 X 10-!° ohm-cm?, while exposures corresponding to 
the attainment of immeasurably low sticking probability give rise to a 
surface resistivity of 12 X 10-!° ohm-cm?. Higher values of the surface 
resistivity observed for much greater exposures may be due to impurities 
in the gas. No appreciable part of the adsorbed layer giving rise to the 
observed values of co; could be removed at room temperature by 
pumping out the nitrogen. 

The results given in Appendix B and Fig. 12 for oxygen on tungsten 
yield a; = 5 X 107!° ohm-cm? at the completion of the initial period of 
rapid absorption, and o; © 10 X 107!° ohm-cm? for exposures roughly 
ten times as great. However, at pressures above 10~° torr the resistance 
continues to increase at a measurable rate up to the longest times ob- 
served (24 hours) and eventually exceeds any of the values observed 
for nitrogen on tungsten. This is attributed to the formation of one of 
the stable oxides of tungsten (perhaps WO;). For the growth of such 
films a logarithmic law, the ‘‘Elovich” equation, has been widely ob- 
served,” but no attempt to follow this growth quantitatively will be 
made in the present paper. For the thickest film observed, resulting from 
the exposure to 6 torr of oxygen for 24 hours (Fig. 4), o; was equal to 
6 X 10-8 ohm-cm’. 

When the pressure has been as high as several torr of oxygen, loosely 
held molecules on the oxide film contribute to the contact resistance. 
These can, however, be pumped off at room temperature. 

While the data on molybdenum are less extensive, the values of o; 
for the first and second steps of adsorption on the two metals are simi- 
lar, as is the behavior of oxygen at a pressure of a few torr. 


4.3 Temperature of Flash Necessary to Remove the Adsorbed Film 


In these experiments, the contacts were exposed to a few torr of the 
gas in question for a few minutes, and the system was then pumped 


* The sticking probability may be a function of pressure at high pressure; 1.e., 
processes other than the arrival rate may control the rate of reaction. 
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Fig. 7 — The contact resistance at room temperature (force = 1 gm) after heat- 
ing for 30 sec with various currents for molybdenum and tungsten contacts which 
have been exposed to a few torr of oxygen or nitrogen. 


down to a pressure of about 10-° torr before the flashing procedure* was 
started. 

The heating current was maintained for 30 seconds, after which 5 
minutes were allowed for cooling. The results are given in Fig. 7, where 
each point is the average for five closings. 

Flashing at about 1000°K causes some lowering of the contact resist- 
ance for oxygen on either metal, whereas for nitrogen there is no ob- 
servable effect below 1450°K on molybdenum, and below a somewhat 
higher temperature on tungsten. For all four combinations of gas and 
metal there is a sharp drop in resistance after flashing at 6.5 amperes 
(1600 + 50°K for molybdenum, 1750 + 100°K for tungsten) and the 
curve is essentially flat above about 1725°K for molybdenum, and 
above 1800°K for tungsten. 

The molybdenum temperatures were measured with an optical pyrom- 
eter and suitably corrected for the emissivity and reflection in the glass. 


* Even at this lower pressure, a layer corresponding to the fast adsorption 
(first step) can be adsorbed in the few minutes necessary for the electrodes to cool, 
so that it was not possible to determine whether the metal had been completely 
cleaned, but only whether the thicker layers had been flashed off. 
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Unfortunately, the wall of the tube in the tungsten experiment had been 
coated with evaporated metal before such a temperature measurement 
was taken, so that these temperatures are taken from the tables of 
Jones and Langmuir,” with an estimated correction for cooling at the 
ends. ; 

The data for oxygen on molybdenum are consistent with those of 
Went,!! who finds some reduction in resistance after heating to 1000°KK 
for 5 minutes and a more marked drop after heating to 1750°K. A rise 
in resistance observed by Went at 1400°K was not observed in the pres- 
ent experiment. 

For oxygen on tungsten the results of field emission microscope experi- 
ments are roughly consistent with the present experiments. These re- 
sults have been interpreted to be due to these successive actions: partial 
desorption from some crystal planes in one minute at 800°K; a tempera- 
ture region (800 to 1400°K) where there is no further desorption; a 
resumption of desorption (perhaps of WQO3;) from all planes in the region 
from 1400 to 1700°K.?:5 

The observation of some relatively loosely bound oxygen is also in 
agreement with Morrison and Roberts,?*”" who suggest that these mole- 
cules are adsorbed on isolated empty sites for the chemisorption of an 
atom, where the molecule is prevented from dissociating by the lack of 
an adjacent empty site. 

For nitrogen on tungsten the present result differs from that of Becker 
and Hartman,”. in that we do not observe a decrease in contact resist- 
ance after 30 seconds at temperatures in the neighborhood of 1100°K 
even when, in their notation, 0 ~ 2 (second step of adsorption complete), 
under which condition they observe a measurable rate of desorption. 
However, there is rough agreement as to the temperature necessary to 
clean the metal surface. . 


4.4 Adsorption at Lower Temperature 


Went"! and de Boer and Kraak? present evidence that oxygen is only 
physisorbed and may be pumped off when it is admitted to a nominally 
clean molybdenum surface at a temperature near 100°K. On the other 
hand, Gomer®*° has shown that, even at 40°K, oxygen is rapidly 
chemisorbed on a clean tungsten surface. One is tempted to conclude 
that the vacuum techniques at the time of the experiments of Went and 
de Boer and Kraak were not adequate to maintain a clean surface during 
preparations necessary before oxygen was admitted (cooling down after 
flashing, in the case of Went, and pre-sintering the evaporated film, in 
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the case of de Boer and Kraak). The reversible adsorption they observe 
may thus apply to surfaces already covered with chemisorbed atoms. 
The arguments presented by Went to show that his surface was essen- 
tially clean can be shown to be inconclusive. One possible exception may 
be the high temperature required to recover his “‘clean’”’ state, but the 
evidence that the surfaces remained clean while cooling down to liquid 
oxygen temperature is unconvincing. 

The present experiments were performed in a liquid nitrogen bath 
for oxygen on tungsten, nitrogen on tungsten and nitrogen on molyb- 
denum. The results of these experiments are tabulated in Appendix C. 
In no case was the initial state of cleanliness recovered by pumping at 
the low temperature. In the case of nitrogen on tungsten, there was an 
increase in resistance during the pumping interval, possibly due to the 
accumulation of impurity gases on the surface. 

The temperature of the contacts in these experiments was somewhat 
above that of the liquid nitrogen bath, because of inadequate radiation 
shielding. With all exposed parts of the tube packed with dry ice to 
reduce thermal radiation, the resistance of one of the tungsten wires 
was used as a thermometer. Using this procedure, the temperature in 
the above experiments was estimated to be 95 + 10°K. 

In summary, the bulk of the nitrogen and oxygen taken up during 
the initial period of fast adsorption at 95°K cannot be removed in a 
few hours by merely reducing the gas pressure to a much lower value. 
This is in disagreement with some early results,!'° but is in agreement 
with more recent experiments.”:?° 


4.5 The Creeping in and out of Adsorbed Gas Between Closed Contacts 


Went (Ref. 11, Fig. 10) reports that physisorbed layers of oxygen at 
room temperature can migrate in and out of the region between contacts 
of molybdenum. He presents arguments to show that this is possible. 
In the present experiments the observed effects were smaller by at least 
one order of magnitude. 

After the sequence of measurements of Fig. 6, where the fall in contact 
resistance with time on clean metal is attributed to a slow increase in 
area, 5 torr of oxygen were admitted to the closed tungsten contacts. 
The contact resistance as a function of time is shown in Fig. 8(a). The 
increase in resistance in 100 minutes is small and occurs in a single jump, 
and so may be due to a jarring of the apparatus. 

If the contacts are closed in the presence of the gas, the resistance 
between the contacts decreases with time [Fig. 8(b)]. This is not neces- 
sarily due to the squeezing out of gas, as it is of the order to be expected 
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Fig. 8 — (a) The resistance between closed tungsten contacts (force = 1 gm) as 
a function of time when 5 torr of oxygen are admitted at t = 0. This sequence ac- 
tually follows that of Fig. 6. While no actual disturbance was noted at the time 
marked ‘‘jostled?”’ the flatness of the curve for earlier and later times indicates 
that this is very likely the reason for the sudden small increase in resistance during 
this interval. (b) The resistance between closed tungsten contacts as a function of 
time after exposure to 6.5 torr of oxygen. At the point marked ‘‘A’”’ on one of the 
curves the gas was pumped out without disturbing the closed contacts. 


because of slowly increasing area, judging from the rate of change of 
the resistance between clean contacts (Fig. 6). If the gas is pumped out 
during such a sequence, a small discontinuity sometimes results [Fig. 
8(b)] which may be due to gas molecules migrating out of a small region 
near the edge of the contact area. 

A similar sequence for oxygen on molybdenum is shown in Fig. 9. 
However, this is for a somewhat lower pressure and smaller time of ex- 
posure before closing than that used by Went! (2.7 torr rather than 10 
torr and two hours rather than ten hours). 


V. INTERPRETATION AND DISCUSSION 


The experiments on clean metal (Section 4.1) show that the assump- 
tions leading to equations (1), (2), (3) and (5) are approximately valid, 
and give a means for measuring the contact area. 

To interpret the data on films adsorbed at room temperature (Section 
4.2) by means of (4) it is necessary to assign a value to the work func- 
tion, g. The changes in contact potential during the adsorption of ni- 
trogen or oxygen show that the work function is increased,”!: *! but in 
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Fig. 9 — The resistance between closed, initially clean, molybdenum contacts 
(force = 1 gm) as a function of time at room temperature. At point A, 2.7 torr of 
oxygen were admitted to the system. Point B is the average resistance for ten 
closings in the presence of the gas. Beginning at point C the contacts were again 
left closed. At point D the gas was pumped out. 


the presence of such an adsorbed film the potential diagram of Tig. 1 
is an oversimplification and the values of 1 obtained by the use of (4) 
can have only an approximate meaning. Furthermore, the film may not 
be uniform over the contact area, in which case the measured values are 
averages heavily weighted toward a value appropriate to the thinnest 
film within the contact area. The values of | according to (4), using the 
data of Section 4.2, are given in Table II. Here, ¢ = 5.0 volts for ni- 
trogen on tungsten; g = 6.0 volts for oxygen on tungsten. 

For the thickness of the layer on a single surface these values of | 
should be approximately halved. Thus, the entire range of thickness of 
the layers on each surface in these experiments is from 1.5 to 2.6 Ang- 
stroms, 1.e., one or two atom diameters in thickness. The interpretation 
of this distance on an atomic model, however, offers considerable diffi- 


Tasie II — TuickNess oF ABSORBED FILM oN TUNGSTEN 


Surface Resistivity Due to Film, ez, Thickness of Film, J, 
ohm-cm?2 Angstrom Units 
Coverage, Molecules/cm? 
Nitrogen Oxygen Nitrogen Oxygen 
1.7 X 1014 3.5 X 10719 5 X 1071° 3.3 3.1 
3.4 X 1014 1.2 X 1079 1X 1079 4.0 3.4 
More than 4 X 10" ~5 X 1079 up to 6 X 10-8 4.5 up to 5.1 
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culty, as is illustrated diagrammatically in Fig. 10. This difficulty is 
connected with the problem of where to place the potential barrier (as 
illustrated in Fig. 1) with respect to the positions of the atoms in the 
surface plane®. 

In Section 4.3 the range of temperatures necessary to remove these 
films from the surface is found to be in general agreement with that 
.determined by other techniques. It is perhaps surprising how similar 
the behaviors of nitrogen on tungsten, oxygen on tungsten, nitrogen on 
molybdenum and oxygen on molybdenum are to one another: All require 
a temperature in the vicinity of 1700°K to remove a large fraction of 
the adsorbed gas within about 30 seconds. The heat of adsorption of the 
desorbing molecules is thus of the order of five electron volts.* 

In Section 4.2 it was shown that some of the oxygen giving rise to the 
measured resistance after relatively great exposures could be pumped 
off at room temperature. This was not the case for nitrogen. The result 
reported in Section 4.3— that temperatures sufficient to remove a 
considerable amount of oxygen had no observable effect on the nitrogen 
layers — confirms the conclusion that additional oxygen is bound, 
though with relatively weak bonds, on top of the more firmly held lay- 
ers.?6,27,34 Nitrogen, on the other hand, shows little tendency to stick 
to the completed nitrogen layers at room temperature, although some 


TUNGSTEN 
ae ATOM 








\ 
CHEMISORBED 
OXYGEN ATOM 


Fig. 10 — A conceivable arrangement of atoms when two (100) faces of a body 
centered cubic lattice are opposed, with chemisorbed gas atoms on each surface in 
one quarter of the lattice sites. If the coverage were twice as great, the opposing 
lattice planes would not necessarily be farther apart — the alternate empty posi- 
tions could simply be filled in. The relative radii are approximately those appro- 
priate to oxygen on tungsten. The distance, x, between the nearest opposed (100) 
planes is 1.75 times that in the normal lattice, H, i.e., about 2.8 A. The distance 
! to be used in (4) should perhaps lie between this value and X — H, the increase 
over the normal distance, which is about 1.2 A. 
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of the gas adsorbed at liquid nitrogen temperature is released as the 
contacts are allowed to warm up (Appendix C, Table VII). 

The differences between the results of Section 4.4 on adsorption at 
95°K and those reported by Went! are unexplained. 

In Section 4.5 the effects of migration of adsorbed molecules between 
closed contacts are found to be much smaller than those observed by 
Went (Ref. 11, Fig. 10). This might be due to low-frequency mechanical 
vibration in his experiments resulting from footsteps etc., which may 
have rocked the contacts, allowing different regions to be exposed.’ Ex- 
tensive precautions against vibrations were necessary to obtain the 
results presented in Figs. 6, 8 and 9. 
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APPENDIX A 


We interpret the values of o; for the entries in Table III numbered 2 
through 6 and 12 and 13, for exposures between 7 X 10-8 and 6 X& 10-5 
torr-min as due to a layer corresponding approximately to the completion 
of the first step of rapid chemisorption. The average value of o; is 
3.5 X 107!° ohm-cm? (£30 per cent). Entries numbered 7, 8, 15 through 
21 and 23 through 27, for exposures between 1.7 X 1074 and 8 X 107! 
torr-min, we attribute to a layer somewhere in the second or third steps 
of slower chemisorption. The value of o; averages 1.2 & 10-® ohm-cm? 
(+25 per cent). 

The higher values of o; in entries 28 through 34 corresponding to ex- 
posures from 1.0 to 85 torr-min might possibly be attributed to an im- 
purity in the gas. Oxygen, if present in a few parts in 105, could account 
for such an effect, while spectroscopically unobservable impurities might 
be present in concentrations of the order of a few parts in 10*. The high- 
est value of o; observed in the presence of nominally pure nitrogen is 
~5 X 10-° ohm-cm”. 

Entries 9, 10, 20, 21, 33 and 34 show that none of the adsorbed gas 


TABLE III — Tur ApsorpPTion or NITROGEN ON TUNGSTEN AT RooM 
TEMPERATURE (Fig. 11) 


23. 
24. 
. Wait one hour 
26. 
27. 
28. 


29. 
30. 


31. 
. Wait 20 min 
33. 


34. 


C© CO No OUR co bo 





Treatment 


. Flash repeatedly; allow to cool for 


10 min in_ residual 
1.1 & 10°-!° torr 


pressure 


. Admit 7 X 107° torr of Ne ; wait 10 


min 


. Wait 35 min 


Wait 35 min 


. Admit 2 10-7 torr of Ne ; wait 5 


min 
Wait 50 min 


. Admit 8 X 1075 torr of Ne ; wait 5 


min 


. Wait 30 min 
. Close leak from Ne source; pump 


for 5 min, pressure down to 


5 X 10°77 torr 


. Pump for 16 hours, pressure down 


to 8 X 10-9 torr 


. Flash repeatedly, pressure falls to 


1 X 10-9; wait 10 min 


. Admit 4.5 X 10-¢ torr of Ne ; wait 3 


min 


. Wait 10 min 
. Wait 10 min 
. Wait 15 min 
. Wait 10 min 
. Wait 15 min 
. Wait 40 min 
. Wait 50 min 
. Close leak from source; pump for 5 


min, pressure down to 1X 10-7 torr 


; Pump for 16 hours, pressure down 


to 3 X 107° torr 


. Flash repeatedly, residual pres- 


ake down to 1 X 107° torr; wait 10 


Admit 4X 10-3 torr of Nz ; wait 5 
min 
Wait one hour 


acu 1.4 X 10-? torr of No ; 3 wait 5 
Admit 3 X 10-2 torr of Ne ; wait 5 


min 
Admit 1.5 X 107! torr of Ne ; wait 2 
min 

Wait 30 min 

Admit 9 X 107 torr of Ne ; wait 5 
min 

Admit 3.3 torr of Ne ; wait 3 min 


Open valve to pumps; in 5 min, 
pressure down to 3 X 10-7 torr 
Wait 30 min, pressure down to 
2.5 X 108 torr 


Contact 
Resistance 
(Milliohms) 

Average of Five 

Closures 

Force = 1 gm 


17 


22 


21 
20 
20 


21 
30 


37 
36 


ot (ohm-cm?2) 


0, assumed 
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Total Exposure 
of Electrodes to 
Gas Since 
Previous Flash 
(torr-min) 


o 


10-8 
1077 
1077 
10-6 


10-5 
10-4 


10% 


mon 
tn oreo ~“ 
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1.3 X 107° 
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OPWNNOSD 
| XXXXXxX 


2X 10° 
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5 X 1071 

6 X 107! 
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1.0 
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20 
85 


DATUM NUMBER 
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RESISTANCE 
mone PRESSURE 


PUMP OUT 


CONTACT RESISTANCE IN MILLIOHMS 


AFTER TIME INTERVAL IN MINUTES 
CLEANING 
BY FLASHING 


Fig. 11 — Nitrogen on tungsten, at room temperature. 
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Fig. 12 — Oxygen on tungsten, at room temperature. 
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TaBLE IV — Tuer ADSORPTION OF OXYGEN ON TUNGSTEN AT RooM 
TEMPERATURE (Fig. 12) 


10. 
11 
12. 


13. 


14. 
15. 
16. 
17. 
18. 
19. 
20. 
21. 








Treatment 


Contact 
Resistance 
(Milliohms) | 

Average of Five 

Closures 

Force = 1 gm 


ot (ohm-cm?2) 





. Flash repeatedly in residual 


pressure 2 X 107! torr; wait 
10 min 


. Flash in presence of 1 X 10-9 


torr of O2 ; wait 10 min, during 
which pressure rises to 2.5 X 
10~° torr 


. Wait 20 min, during which 


pressure rises to 3.56 X 1078 
torr of O2 


. Wait 30 min, pressure up to 


6.5 X 10-8 torr 


. Wait 2 hours, pressure up to 


2.7 X 10°77 torr 


. Reduce pumping speed, pres- 


sure rises to 2 X 10° torr; 
wait 5 min 


. Wait 1 hour, pressure up to 


5 X 10-8 torr 


. Wait 20 min, pressure up to 


6 X 107° torr 


. Close valve to O2 source; in- 


crease pump speed; wait 10 
min, pressure down to 5 X 1078 
torr 

Pump for 80 min, pressure 
down to 1.5 X 10-8 torr 

Pump for 16 hours; pressure 
down to 2 X 107° torr 

Flash repeatedly, pressure 
falls to 1.8 X 1079 torr; wait 
10 min 

Admit O2 to closed system at 
~2 X 10-5 torr; wait 30 min, 
pressure falls to 5 X 1076 torr 
Wait two hours, pressure 
down to 1.8 X 10-6 torr 

Wait 30 min, pressure down to 
1.6 X 10-6 torr 

Admit O2 to pressure 9 X 10-6 
torr; wait 10 min 

Admit O2 to pressure 3 X 1075 
torr; wait 5 min 

Wait 30 min, pressure 4 X 1075 
torr 

Wait 40 min, pressure 5 X 1075 
torr 

Wait one hour, pressure 5 X 
107° torr 

Increase pressure to 3 X 10~4 
torr, wait 15 min 


20 0 

23 2X 107!° 
29 6 X 107?° 
30 7X 107?° 
33.5 9.5 X 10°10 
33 9.9 X 10-19 
34.5 1.0 X 10°° 
37 1.2 X 10° 
35 1.1 X 10-° 
35 1.1 X 107° 
25 3 X 107!° 
28 6 X 107!° 
33 9 X 1071° 
35 1.1 X 10° 
39 1.3 X 10-9 
46 1.8 X 107° 
61 2.8 X 107° 
55.5 2.5 X 107° 
74.5 3.8 X 10-9 
127 7.5 X 10-° 


20 0, assumed 


Total 
Exposure of 
Electrodes to 
Gas Since 
Previous Flash 
(torr-min) 


0 


2X 10° 


4X 107% 


2X 10° 
2X 1075 
3 X 1075 


2X 104 
3.2 X 10-4 


1.3 X 10% 


~3 X 1074 


~10°3 
~107% 
~10-° 
~107% 


|\~2 & 1073 


~3 X 10% 
~10 
~10 
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TaBLE [IV — Continued 





Contact Total 














Resistance Exposure of 
Trentment pollichms) | os (ohmems) | Rete to 
Closures Previous Flash 
Force = 1 gm (torr-min) 
22. Close O» leak; increase pump- 132 7.7 X 1079 — 
ing speed; wait 7 min, pres- 
sure down to 2.5 X 10-° torr 
23. Wait 25 min, pressure 1.2 X 110 6.3 X 1079 — 
10-6 
24. Pump for 16 hours, pressure 111 6.4 X 107° — 
1.1 X 1077 torr 
25. Admit Oe to pressure 1.4 torr; 219 1.4 X 1078 ~30 
wait 20 min 
26. Wait one hour 240 1.5 X 10-8 100 
27. Increase pressure to 2.1 torr; 215 1.4 X 107° 175 
wait 30 min 
28. Wait 2 hours 835 2.2 X 10-8 400 
29. Open valve to pumps; wait 193 1.2 X 1078 — 
one hour, pressure 5 X 107 
torr 
30. Pump for 114 hours, pressure 175 1.1 X 1078 — 
2.5 X 1077 torr 
31. Pump for 16 hours, pressure 184 1.1 X 10-8 — 
7 X 10-8 torr 
32. one 5.5 torr of Oo; wait 2 354 2.3 X 1078 ~1,000 
ours 
33. Wait 30 min 410 2.7 X 10-8 ~1,000 
(2 unusually (~1.0 X 1077) — 
large readings 
not included in 
average ~1400) 
34. Open valve to pump; wait 2 319 2.0 X 1078 — 
hours, pressure down to 1 X 
1077 torr 











giving rise to the measured resistance pumps off at room temperature 
even after the nitrogen pressure has been as high as 3 torr. 


APPENDIX B 


We attribute entries in Table IV numbered 3 through 6 for exposures 
from 4 X 10-7 to 3 X 10- torr-min to the first step of fast adsorption, 
with o: © 5 X 107” ohm-cm?. Entries 7 through 11, 14 and 15, for ex- 
posures from 2 X 10~* to about 10? torr-min, are attributed to the com- 
pletion of the second step, with o; + 1.0 X 10-° ohm-cm*. For pressures 
of oxygen above ~10-> mm of mercury, the resistance continues to 
increase at an observable rate (entries 16 through 33). 

Entries 28 through 31, 33 and 34 show that, after exposure to a few 
torr, a considerable decrease in resistance occurs on pumping out the gas. 
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APPENDIX C 


Adsorption and Desorption at Low Temperatures 


oN OO PWN Ff 


orn WD 


TABLE V — OxYGEN ON TUNGSTEN 


Contact 
Treatment Resistance 
(Milliohms) 





. Expose to 10~® torr of O2 at room temperature for a few minutes; 


measure resistance with force = 1 gM............ cc cee cee cee ees 


. Close contacts with force = 2 gM... ..... ieee ce ee cece nes 37 
. Apply liquid N2 and wait 30 min with contacts remaining closed.... 17 
. Increase Oz pressure to 1075 torr; allow contacts to stand open 15 
min; measure resistance with force = 1 gm................... 0c eee 140 
LOWalt 2 OMT Rot ras Gates eters aoa ata Cee are wee Re ea nares 160 


. Open valve to pumps, wait 1 hour until pressure is 10~’ torr; measure 


resistance with force = 1 gM............. 2c ccc ccc ttt 160 


. Remove liquid Nz; allow system to warm 10 minutes, pressure = 


2X 1077 torr (force = 1 gm)...... ee tee cence eee 120 


. Pump 16 hours at room temperature, pressure = 10-8 torr (force = 


DRT) cee bt ee eA odes tlh BL thes Roath ts rata Slate a Meet ae at 90 


TaBLE VI — NirroGen on MoutysBpENUM 


Peas: 

esistance 

Treatment (Milliohms) 
Force = 1 gm 


. Flash contacts in liquid Ne bath; allow 30 min to cool, pressure = 


CE NOAA OPP rae. Vek caxcd hte eee AS Vas be Ec RE oad Sa eee ee 6 
Admit Nez, pressure = 5 X 107° torr; wait 10 min.................. 22 
. Open valve to pumps; wait 1 hour, pressure = 1 X 10~° torr.......... 23 
. Admit nitrogen, pressure 6.0 torr; wait 10 min...................... 41 
. Open valve to pumps; wait 10 min, pressure = 10-6 torr............. 41 


TaBLE VII — NirrRoGEN on TUNGSTEN 


peonteet 

esistance 

Treatment (Milliohms) 
Force = 1 gm 


. Flash in liquid Nz bath; wait 30 min, pressure 2 X 107!° torr......... 8 
. Admit 1.0 torr of No; wait 40 min..... 2... cece eee 85 
. Open valve to pumps; wait 5min, pressure down to 3 X 10-* torr.... 124 
. Wait 1 hour, pressure 3 X 1077 torr. . 2.2... 1. cece eee eens 150 


. Remove liquid Nz; wait 15 min to allow system to come to room tem- 


perature, pressure = 107° torr......... 0... cece ee eee eee meeting hens 60 
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Shot Noise in p-n Junction Frequency 
Converters” 


By A. UHLIR, JR. 
(Manuscript received January 22, 1958) 


General equations are derived for the noise figure of a frequency converter 
using a p-n junction diode with arbitrary minority-carrier storage. A p-n 
junction with a purely capacitive nonlinear admittance permits, theoreti- 
cally, noiseless amplification. Structures are suggested for approximating 
this ideal. A diffused silicon junction diode gave 9 db of gain with a 2-db 
noise figure in converting 460 to 9375 me and gave a 3-db noise figure as a 
6000-me negative-resistance amplifier. Nonlinear-resistance diodes cannot 
amplify but can give low-noise frequency conversion if the local-oscillator 
drive produces a sharply pulsed current waveform in the diode. 


I. INTRODUCTION 


The shot noise output of any p-n junction device in any circuit can be 
calculated by enumerating the possible hole and electron motions and 
summing the noise energies delivered to the load. This method of analy- 
sis will be illustrated by applying it to the de-biased p-n junction (Sec- 
tion IX). A more complicated example, and the principle topic of this 
paper, is the general p-n junction frequency converter (Section XI). 

The technological significance of two kinds of semiconductor diodes 
used as frequency converters will be discussed briefly. Section II will 
explain how this theory of shot noise stimulated the recent development 
of low-noise microwave amplifiers using the nonlinear capacitance of 
p-n junctions. Section III mentionsthe nonlinear resistor, for which shot 
noise appears to be a vital problem. 

The ‘“particle-method” formulation of p-n junction theory in terms 
of the statistics of individual carrier motions (Section VI) is an alterna- 
tive to the diffusion equations ordinarily used in the analysis of p-n junc- 
tions. Either approach leads directly to the impedance of the junction 

* Supported in part by the U. 8. Army Signal Corps under contract DA 36-039 
se-5589. Preliminary reports of this work were given at the Microwave Crystal 


Rectifier Symposium, Fort Monmouth, N. J., February 1956, and the I.R.E.- 
A.I.E.E. Semiconductor Device Research Conference, Boulder, Colo., July, 1957. 
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(Sections VII and VIII) and to the small-signal transmission properties 
of frequency converters (Section X). However, the particle method is 
the most direct way to calculate shot noise. 

After the general equations for noise figure are derived in Section XI, 
the consequences for nonlinear capacitors and nonlinear resistors are dis- 
cussed in Sections XII and XIII. One of the virtues of the analysis is 
the possibility of relating impedances and noise without regard to the 
structure of the p-n junction. However, a few examples of nonlinear- 
capacitor structures are given in Section XII. 

As explained in Sections IV and V, the particle method and the diffu- 
sion equation method deal with just one aspect of p-n junction phe- 
nomena: minority-carrier effects, which have varying importance, de- 
pending upon the device and its use. Minority carrier effects are nowhere 
more important than in p-n junction transistors. Thus, the particle 
method is an excellent way to calculate the shot noise in transistors, 
although such a calculation will not be given in this article.* 


II. NONLINEAR-CAPACITANCE AMPLIFIERS 


A time-varying capacitor is an active device that can be used as a 
linear amplifier of small signals, usually, but not necessarily, with a shift 
in signal frequency. The power required for the amplification is supplied 
by the motor, of whatever kind, that varies the capacitance. This prin- 
ciple is used in vibrating-reed electrometers for amplifying very low 
frequencies.” 

To obtain microwave amplification, one needs a capacitance that 
varies at microwave or higher frequencies. Such a time-varying capaci- 
tance can be obtained by applying a microwave voltage to the voltage- 
dependent capacitance of a p-n junction diode. Remarkably general gain 
and power relations can be derived for circuits of this type.’ 

Amplifying diode-frequency converters are not new. During World 
War II, welded-contact germanium diodes were found to give conver- 
sion gain when used in superheterodyne receivers of the type then under- 
going extensive development for radar use.* With the then-existing tech- 
nology, it was found that these diodes were not as good as point-contact 
silicon diodes in the over-all system noise figure that could be obtained. 
As a result of this observation, point-contact silicon diodes took over the 
entire radar field until recently, and the manufacture of welded-contact 
diodes was discontinued. 
= For such a calculation see van der Ziel, A. and Becking, A. G. J., Theory of 


Junction Diode and Junction Transistor Noise, Proc. I.R.E. 46, March 1958, pp. 
589-594. 
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These results, though entirely empirical, may very well have led to a 
belief that amplification by diodes was a freak effect and necessarily 
noisy. In the years following World War II, casual experiments on diode 
amplification were performed in many laboratories, but low noise was 
not found, nor was it intensively sought. The amplification effect was 
put to practical use in transmitting modulators, where noise would not 
be important.® 

At this point, the shot-noise theory of p-n junction frequency con- 
verters was developed. It provided one logically complete example of 
how a diode could amplify with arbitrarily low noise. It was recognized 
that the minority-carrier mechanism assumed in the analysis was prob- 
ably not as important, at high frequencies, as the depletion-layer ca- 
pacitance; at the same time, the understanding acquired from the shot- 
‘noise theory suggested that the depletion-layer capacitance would also 
be noiseless. 

Subsequent efforts to make improved nonlinear capacitance diodes® 
and to give them a fair circuit evaluation have been extremely well re- 
warded. A diffused-junction silicon diode has given 9 db of gain with a 
2-db noise figure in converting 460 mc to 9375 me.’ Also, such a diode 
has given 35 db of negative-resistance gain at 6000 me with a 3-db 
noise figure (for double sideband input).? 

In view of the large variety of possible frequency relations, to say 
nothing of the circuits involving combinations of diodes, it is fortunate 
that it can be deduced that an ideal nonlinear capacitor is noiseless in any 
circuit. By p-n junction techniques it is possible to make nonlinear ca- 
pacitors that are practically ideal, except for a series resistance. Work- 
ing theories of nonlinear capacitors will therefore be concerned with the 
noise from this series resistance.’ 


III. NONLINEAR RESISTORS 


Point-contact crystal rectifiers are in widespread practical use in 
microwave superheterodyne receivers. It has long been believed that the 
useful action of these devices results from nonlinear resistance and, as 
yet, there is no reason to doubt that this is true. 

As far as noise is concerned, there is no difference between the mathe- 
matical results for the p-n junction type of nonlinear resistance and the 
results that would have been obtained from earlier ideas of metal-semi- 
conductor contacts. But the older theory was not carried very far for the. 
calculation of noise figure. Therefore, it seems worthwhile to point out 
the importance of the local-oscillator waveform. 
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In the past, most analyses have been made on the assumption of 
short-circuited harmonics. This rather arbitrary postulation of a wave- 
form is not realizable in practice, but it does give rise to some simple 
relations between conversion loss and noise figure. Another case, open- 
circuited harmonics, will be found to have an entirely different relation 
between conversion loss and noise figure. Thus, the analysis of nonlinear 
resistors will show, by example, that no general relation between gain 
and noise figure is true. 

In contrast to the nonlinear resistor, a nonlinear capacitor gives low 
noise for any local-oscillator waveform. 


IV. p-n JUNCTION THEORY 


As shown in Fig. 1, there are two important methods of analyzing p-n 
junctions, each complementing the other. The pseudo-equilibrium 
method is an accurate means of calculating the capacitance-voltage 
characteristic of a p-n junction if one knows, from other considerations, 
that it is a nonlinear capacitor. However, this method gives no informa- 
tion about frequency dependence and noise. 

The kinetic method begins with an approximate separation of physical 
mechanisms, as discussed in Section V. This separation makes it rela- 
tively easy to calculate the frequency-dependent admittance and noise. 
The kinetic method can be applied not only to nonlinear resistors and 
nonlinear capacitors but also to the host of intermediate types of p-n 
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Fig. 1 — Structure of p-n junction theory involved in or related to the calcula- 
tion of noise figure. The encircled elements constitute the ‘‘classical’’ kinetic 
theory of p-n junctions. 
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junctions (including the so-called “simple” p-n junction between homo- 
geneous semi-infinite p and n regions). 

The usual mathematical approach in the kinetic method involves 
solving the diffusion equation, a differential equation for the average 
motion of carriers. This approach, which will be referred to as ‘‘classical’’ 
p-n junction theory, is satisfactory for calculating junction admittance. 
In calculating noise, it seems to be much easier first to calculate the noise 
due to individual carrier motions and then to sum them. This second 
procedure will be referred to as ‘the particle method,” since the indi- 
viduality of the carriers is preserved to a later stage in the calculation. 

The particle method gives relations between small-signal junction ad- 
mittances and the noise figure of frequency converters. The train of 
reasoning to be used is indicated by the arrows in Fig. 1. However, the 
connecting lines in Fig. 1 may be followed in any direction. For exam- 
ple, the small-signal admittances may be calculated from the diffusion 
equation or measured experimentally and used to calculate gain and 
noise figure. 


V. APPROXIMATE SEPARATION OF PHYSICAL MECHANISMS 


The equivalent circuit of Fig. 2 denotes the principal physical mecha- 
nisms of p-n junction action. The p-n junction is considered to be di- 
vided into essentially neutral p and n regions and an intervening space- 
charge region known as the depletion layer. 

The capacitance of the depletion layer may be analyzed by pseudo- 
equilibrium methods under the approximation that the depletion layer 
is free of carriers.° This capacitance varies with voltage, and amplifying 
frequency converters can be built that rely on it alone. It is reasonably 
certain that the depletion-layer capacitance will be very important in all 
high-frequency nonlinear-capacitance diodes. Nevertheless, we will ig- 
nore the depletion-layer capacitance entirely in the following theory of 
gain and shot noise, except for qualitative remarks. This theory will be 
developed around the motion of carriers across the depletion layer. 

The theory of noise thus obtained will illustrate the fact that at least 
one conceptual type of nonlinear capacitance amplifier is noiseless. It 
will then perhaps be easy to believe that the depletion-layer capacitance 
also should not be a source of noise. The analysis should be directly ap- 
plicable to practical nonlinear resistors, for these must have relatively 
small values of depletion-layer capacitance for satisfactory operation. 

The inescapable importance of the depletion-layer capacitance is con- 
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Fig. 2 — Equivalent circuit of p-n junction. In general, the nonlinear resistors 
and capacitors may be frequency-dependent. 
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Fig. 3 — Simplified equivalent circuit in which constant parasitic elements are 
assumed and hole and electron admittances are combined. 
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nected with the fact that there is series resistance Rs between the junc- 
tion and the metallic contacts, as indicated in Fig. 2. For many diodes, 
the series resistance is found to be reasonably constant as bias is varied, 
giving the equivalent circuit in Fig. 3. This observation is consistent with 
(but does not prove) the conjecture that the series resistance exhibits 
thermal noise. If this conjecture is true, the noise figure will depend upon 
the frequencies involved and upon the product RsCmin , where Cimin is 
the value of the depletion layer capacitance at the maximum reverse 
voltage. But the immediate problem is to calculate shot noise due to 
minority carriers (i.e., due to carriers that cross the space-charge region), 
and Rs and the depletion-layer capacitance will both be neglected. That 
is, the analysis will be carried out for the equivalent circuit of Fig. 4. 
It should be noted that this neglect corresponds to the usual first ap- 
proximation made in analyzing junction transistors.” 


VI. STATISTICAL DESCRIPTION OF MINORITY-CARRIER MOTION 


Each minority carrier —a hole in an n region or an electron in a p 
region — can be thought of as an individual. A hole may begin its exist- 
ence as a minority carrier by surmounting the barrier potential of the 
depletion layer and entering the n region, or by being spontaneously 
(thermally) generated in the n region. Its existence may end by recom- 
bination in the n region or by return to the p region. The situation for 
electrons is analogous. 

One need consider only the holes and electrons that cross the barrier. 
The life-histories of these carriers, which are the elementary events of 
the theory, fall into the three classes shown in Fig. 5. Their meandering 
paths indicate that the motion of a carrier is governed by statistical 
rules. These rules depend upon the detailed structure of the junction, 
but it is in keeping with classical p-n junction theory to assume that they 
do not depend upon the bias applied to the junction. 


~<— 
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Fig. 4 — Equivalent circuit of p-n junction with negligible parasitics. 


958 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 


Of the three classes of elementary events, the reverse carriers will, 
throughout the analysis, be the simplest to deal with because they are 
assumed to occur entirely at random. Holes thermally generated in the 
neutral region of a p-n junction move about according to statistical 
rules, and some of them come in contact with the depletion layer. The 
electric field in the depletion layer is in such a direction as to move these 
holes across to the p region (forward bias decreases this field but ordi- 
narily does not change its direction). Similarly, some of the electrons 
generated in the p region cross to the n region. These “saturation cur- 
rents” are represented by current generators 7;, and zs in Fig. 2. They 
are direct-current generators and do not enter physically in the ac ad- 
mittance of the junction, although their values can be introduced, by 
mathematical manipulation, into expressions for the ac admittance. 
But the saturation currents do produce shot noise. The sum of 7, and 
isn 18 the saturation current, which may be determined experimentally 
by measuring the current at a moderate reverse voltage. 

Of the many holes in the p region, a few will overcome the potential 
barrier of the space-charge region and be emitted into the n region. The 
rate of emission will be assumed to be a nonlinear function of the po- 
tential across the space-charge region. This function will be written 
R(v), where v is the deviation (in the forward direction) from the equi- 
librium potential. ‘Classical’? p-n junction theory, to be considered in 
Section VIII, involves assumptions equivalent to taking 
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Fig. 5 — The three classes of hole and electron motions (‘‘elementary events’’) : 
(1) reverse carrier from spontaneous generation; (2) forward carrier — emission 
followed by recombinations; (8) forward-and-back carrier — emission and return. 
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@(v) = constant x &” B= QG/kT, (1) 


where q is the electronic charge, k is Boltzman’s constant, and T’ is 
absolute temperature. The constant of proportionality is not specified. 
The conversion and noise analyses will be carried out with an arbitrary 
function R(v), to accommodate refinements in p-n junction theory that 
may specify an emission relation differing from (1). 

The probability of an emitted carrier not returning will be denoted 
Py. Carriers with such a fate lead to a nonlinear conductance in the 
equivalent circuit. The probability of an emitted carrier returning at 
time ¢’ after emission will be P(t’). Forward-and-back carriers result, 
in general, in a nonlinear conductance and a nonlinear capacitance whose 
values and frequency dependence can be calculated from P(¢’). Since an 
emitted carrier must either return or not return, 


coe ee i ” P(t) dt. (2) 


It is consistent with the usual approximations of p-n junction analysis 
to assume that Po is a constant and that P(t’) depends only upon ?’. 
These assumptions will permit the noise analysis to be carried out rather 
simply. However, the effect of high current densities upon Po and P(¢’) 
may be an important feature of point-contact diodes. 

All that has been said about the emission of holes from the p side 
applies to the emission of electrons from the n side. Of course, R(v), Po 
and P(t’) may be quite different for the electrons. In general, one must 
perform analyses for the holes and electrons and then add the currents. 
However, if (1) holds, ® for electrons is proportional to ® for holes, for 
all values of v. Then one can define suitable ®, Po and P(t’) to apply 
to holes and electrons taken together. To simplify the presentation, it 
will be assumed that this has been done. 

In assuming a certain probability for recombination of an emitted 
carrier, we are, in effect, assuming linear recombination. The assump- 
tion that the probability of return depends only on the time after emis- 
sion is equivalent to the assumption of a linear diffusion equation in- 
cluding, possibly, drift in a steady electric field. These are, of course, 
usual assumptions which are made in analyzing p-n junction devices. 
It is not necessarily assumed that the recombination occurs uniformly 
throughout the volume. Surface recombination or recombination at 
localized centers is consistent with the assumptions as long as the re- 
combination is linear. Also, we are not restricted to simple diffusion; 
drift in a steady electric field is permitted. 

The currents for the three types of carrier events are illustrated in 
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Fig. 6. The reverse carrier gives a single negative pulse with an area 
equal to one electronic charge; the forward carrier gives a positive pulse 
with an area of one electronic charge. The detailed shape of the pulses 
might differ from one to the next and, furthermore, the average shape 
of the forward pulses might be somewhat different from the average 
shape of the reverse pulses. However, it will be assumed in most of the 
analysis that the entire pulse is of such short duration that the exact 
shape is of no consequence at the frequencies of interest. The third type 
of event — forward emission followed by the return of the same car- 
rier — leads to a current whose integral is zero net charge. The forward 
pulse part of this single event should be similar to the forward pulse of a 
carrier that is emitted and does not return. Likewise, the reverse pulse 
part should be similar to the reverse pulses of the thermally generated 
carriers. However, the fact that the two pulses correspond to a single 
event is statistically important. 


REVERSE 


| \ FORWARD 


| \ FORWARD-AND-BACK 


TIME —> 


CURRENT —> 


Fig. 6 — Current pulses for the three types of minority carrier events. 
VII. SMALL-SIGNAL ADMITTANCE 


The small-signal admittance of the p-n junction will be calculated from 
®, Po and P(t’). The calculation introduces some relationships needed in 
the main analysis. To avoid the introduction of a large number of symbols, 
a time function and its Fourier coefficients or spectrum will be denoted 
by the same symbol, for example, z(t) and 7(w); P(w) is an exception 
with a special definition to be given in (10). 

In general, the current is equal to the difference between the inant 
of forward carriers and the current of reverse carriers. For instantaneous 
forward and reverse transits, this means 


i) = i, + gato] - 4 [ dalle — e)IP). (3) 


The spectrum of 7 may be found from 
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ila) = 5 a ide ™ dt. (4) 


This normalization of the spectrum will be adhered to in the calculation 
of noise. The small-signal admittance (also, the conversion matrix dis- 
cussed in Section X) is a ratio of current to voltage, so any consistent 
normalization may be used. Consider, then a voltage of the form 


v(t) = % + Re v(w)e™, (5) 


where v(w) is the complex amplitude of a vanishingly small ac voltage 
of angular frequency w. The expected solution is 


i(t) = % + Re i(w)e™, (6) 
i(w) = Y(w)v(w). (7) 


The problem is to find Y(w) and 7%. 
Because v(w) is vanishingly small, 


Rlv(£)] = R(vo) + A’ (vo) Re v(w)e**’. (8) 


Inserting this expression in (8), and performing the integration indicated 
in (4), one finds that 


Y(w) = qR'()P), (9) 

where 
Pa) 2 i o*" P(t!) dt (10) 

Also, 
to = —ts + qR(vo)Po (11) 


after (2) is used. 


VIII. CLASSICAL Pp-h JUNCTION ANALYSIS 


The analysis given thus far will be developed later into ways of re- 
lating conversion gain and noise to the small-signal admittances of a 
junction. Classical p-n junction analysis provides a way of predicting 
the small-signal admittances of postulated structures, although it does 
not give unique values for R(v), Po and P(t’). Instead, it gives the prod- 
ucts R(v)Po and R(v) P(t’), which suffice for the desired calculations. 

The basic assumption of the classical theory is that the hole concen- 
tration p: on the n-side of the depletion layer is given by” 


pi = pe (12) 


where py is the equilibrium value of p,. Equation (12) is a boundary 
condition for the diffusive flow of holes in the neutral n-region. The 
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generalized diffusion equation (including recombination and drift) is 


oP = —2 7 PY Dvn — pV-(pE). (13) 
T. ‘ 


The hole concentration p is a function of time and of position within 
the neutral n region; D, yw and 7 are diffusion constant, mobility and life- 
time for holes in the n region. The equilibrium hole concentration py 
will vary with position in the n region if the donor concentration is not 
uniform. For the formal analysis, 7 and the electric field E will be as- 
sumed to be independent of p and ¢, so that (13) is a linear partial differ- 
ential equation. 

The hole current density can be calculated from 


J = —qDVp + qupE. (14) 


If the voltage, v, is composed of a number of sinusoidal components’ 
the hole concentration will be composed of sinusoidal components. Be- 
cause (12) is nonlinear, the set of frequencies involved in 7, will not, in 
general, be the same as the set of frequencies in v. Even so, the relation 
between p; and J is linear and may conveniently be expressed in the form 


i(u) = AP Pe) (15) 


where A is the junction area and L, is the “effective ac diffusion length” 
for the structure in question. In other words, L. is simply a quantity 
that makes (6) and (15) give the right answer for 7(¢). This right answer 
is the current obtained by solving (13), applying (14) and integrating 
over the junction area; this procedure will be used in Section XII in the 
calculation of Z., for a particular nonlinear-capacitor structure. 

The small-signal hole admittance can be found by considering in (12) a 
voltage of the form given in (5). Then 


Bvo 
Vins Pea pe (16) 


The electron admittance is calculated similarly, usually with a quite 
different value of L, . 
A comparison with the statistical description can now be made using 
(9) and (16). One finds 
BADpye*”® 


Gi'(0) P(w) = =— Par, (17) 


or 
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ADpye*”® 


pe, (18) 


R(w)Pw) = 
since ®’ equals 6&, from (1), which is the form of ® that must be as- 
sumed to effect a comparison of the particle theory with the classical 
theory. Also, if (1) is assumed, (9) can be written 


Y(w) = gBG(v)P(w). (19) 


In the analysis of conversion and noise, it will be found that all im- 
portant quantities can be expressed in products of RP(w) or R’P(w). 
Classical p-n junction theory therefore gives a complete solution of the 
problem. Separately defined values of ® and P(w) are not needed be- 
cause a carrier that goes across and then comes back in an infinitesimal 
time might just as well not have been emitted in the first place. 

The noise analysis will be carried through for an arbitrary emission 
function, R(v), in case the classical theory is not a satisfactory one for 
some situations. 


IX. SHOT NOISE FOR dc BIAS 


The application of the above principles to the calculation of shot noise 
in de-biased p-n junctions has been published." For this simple case, 
many of the steps in the calculation combined so obviously that explicit 
mention of each step was not necessary. Here, a step-by-step derivation 
of the de results will be given as a model for more complicated cases. 

The calculation of shot noise will be based on the following general 
principle: Each elementary event delivers a certain amount of noise 
energy to the load. The noise power is the rate at which noise energy is 
delivered to the load. 

At a given terminal pair of a linear circuit, the noise can be repre- 
sented by a mean-square current generator (:”), since the noise power 
delivered to a load connected to the terminals can readily be calculated 
from such a generator, the load admittance and the output admittance 
of the terminals. Representation by a current generator seems particu- 
larly appropriate for shot noise, where the elementary events are repre- 
sented by current generators. 

In p-n junctions, the elementary events are the three classes of mi- 
nority-carrier histories: reverse, forward and forward-and-back. Associ- 
ated with each carrier event is a small-signal current generator J(t), as 
illustrated in Fig. 6, with Fourier transform J(w). 

Suppose that J(w) is observed for an infinitesimally small frequency 
band B. Then the transform back to the time domain of this selected 
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element of the spectrum is a current generator 

g(t) = i. dw5(w)e*’ + complex conjugate, (20) 
where the indicated integration is over the values of w that lie within the 


selected band. 
If one writes 


g(t) = A+ A*, (21) 
then §°(¢), which is proportional to the instantaneous input power, is 
S(t) = 2AA* + A? 4+ (A*)’, (22) 


Integration over total time (to obtain a quantity proportional to the 
energy) can be shown to drop out the latter terms, so that 


/ #(t) dt = 2 / AA* dt. (23) 


Now, 4AA* can be written as the double integral 


AA* = | do | du's(u)s*(u' ye (24) 
B B 
Integrating with respect to time gives 
[ sata = [dw | da’s(w)s*(o)2xa(0 — 0')] 
B B 


(25) 
aor [ des | 9() [°, 


so that 
[ #0 at = 89°B | 9(e) F. (26) 
If the carrier transits are instantaneous, a forward or reverse transit at 
time ¢ gives a current generator | 
S(t) = +q6(t — bt), (27) 


where the impulse function 6(¢) has negligible width in time, but its 
integral over time is unity. 
The spectrum of (27) is 


se ie (28) 


T 


ieee i] fee ie ds 
Qn — 00 
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Tor either a forward or a reverse transit, (26) becomes 
/ H(t) dt = 2q°B. (29) 
The development thus far could be used only to treat the nonlinear- 
resistor type of p-n junction. For the general p-n junction, forward-and- 


back events must be considered. A forward pulse at é , followed by a 
reverse pulse at f& + t’, gives a current generator 


S(t) = gi(t — to) — go(t — to — ¢), (30) 
Sw) = 5 lee — oH | (31) 


For this case, (26) gives 
/ S(t) dt = 2¢ B(2 — 2 cos wt). (32) 
The shot noise for each class of carrier event is found by multiplying 
the appropriate integral in (32) by the rate of occurrence of such carrier 
events. Thus, 


Ur ies = 2q°B = 2qBi. ) (33) 


@)torwara = 29° BR(v0)Po , (34) 


l 


 ortand aha heck 


2¢° BR(v0) i ° (2 — 2 cos wt’) P(t’) dt’ (35) 


2¢°BR(v)[P(w) + P(w)* — 2Pol. 
The total shot noise is the sum of (33), (84) and (86): 
(row = 2gB{is + gH(vo)[P(w) + P(w)* — Pol}. (36) 


For the important special case of classical p-n junction theory, (36) 
can be transformed through use of (11) and (19) into 


i \rotar = 4kTBG(w) — 2qioB, (37) 


where G(w) is the real part of Y(w). This result is easy to remember be- 
cause it has the form ‘‘thermal noise minus shot noise’. However, it is 
evident from the derivation that no subtraction of one noise from another 
is actually involved. 

A previous derivation of (87) was given under less general assump- 
tions,” and the equation was compared with experimental results for 
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p-n junctions.*:4 The good agreement with experiment suggests that 
it is worthwhile to analyze the noise of a p-n junction frequency con- 
‘verter. 


X. SIGNAL TRANSMISSION IN LINEAR FREQUENCY CONVERTERS 


In a linear frequency converter, the output signal voltage and current 
are linearly related to the input signal voltage and current. . 

Of course, frequency conversion by a nonlinear device requires a finite 
local-oscillator voltage. The local-oscillator interaction with the device 
is quite nonlinear. Since the local-oscillator drive is regarded as a power 
supply, not an intelligence-bearing signal, a linear frequency converter 
can be made with a nonlinear device. Linear behavior is obtained when 
the signal voltages are assumed to be arbitrarily small. 

The analysis of signal transmission in the linear frequency converter 
will be made on the assumption that the local-oscillator waveform is 
periodic, with a fundamental frequency b/27. This assumption is not 
trivial, for nonlinear capacitors can be tuned to oscillate at frequencies 
incommensurate with the driving-oscillator frequency, as well as at sub- 
harmonics of this frequency. The formal results could be generalized to 
cover the nonperiodic case, but most practical situations involve periodic 
waveforms. 

The frequency components of the emission rate ®(v) must be related 
to the frequency components of the voltage, v(¢), applied to the junction. 
The voltage will be given in the form 


v(t) = viol) + rs1c(4), (88) 


where vzo(t) is the large, periodic local-oscillator voltage (plus any de 
voltage) and vgic¢(t), which will be regarded as arbitrarily small, is the 
sum of all the signal voltages. Here the word “signal” includes the input 
and output signals and the image-frequency signals, all of which are 
linearly related. The method of calculating these linear relationships 
will next be given. The local-oscillator voltage, whatever its waveform, 
may be written 


Yro(t) = » Uz,0(m) ene (39) 


where m takes on all positive and negative integer values including zero. 
Most frequency-conversion problems can be treated by proper identi- 
fication of s in the following representation for vgic : 


Usra(t) = Re >, v(mb + set! (40) 
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For example, s/27 could be the intermediate frequency (output) of a 
superheterodyne receiver (igs. 7 and 8), or the input frequency of a 
transmitting modulator (Fig. 9). All of these circuits and many others 
can be obtained by suitable connections to the multiterminal “black 
box” indicated in Fig. 10. Negative frequencies are included in the set 
(mb + s)/2z; they are the “lower sidebands’’. For the sake of obtaining 
noise-figure expressions, it will be assumed that the input and output 
are at two distinct frequencies (mb + s)/2m and (mob + s)/2z, as indi- 
cated in Fig. 11. In case the input and output are at the same frequency, 
as in a negative-resistance amplifier, the noise-figure expressions will 
not be applicable, but (65) will give the shot-noise current at the fre- 
quency of interest. 

Any information-bearing signal will contain a number of frequencies 
(in general, a continuous spectrum). The linear nature of the small-signal 
problem enables one to analyze for each frequency separately. 
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Fig. 7 — Upper-sideband superheterodyne receiver (down-converter). 
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Fig. 8 — Lower-sideband superheterodyne receiver (down-converter). 
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Since vsjg is arbitrarily small, one can write 
Rlv(t)| = R[vro()] + @’[vro(t) ]osre - (41) 


The de and large-signal de components of ® are given by the Fourier 
expansion of ®[vyo(t)]. The Fourier expansion of &’[vpo(t)] 1s required for 
the signal transmission, that is, for calculating the conversion matrix. 
Both expansions are necessary to calculate the noise figure. 

Assume that the required Fourier coefficients have been calculated, 
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Fig. 9 — Double-sideband transmitting modulator (up-converter). 
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Fig. 10 — “‘Black box’”’ representation of a linear frequency converter. 
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so that one can write 


Rlvpo(t)] = Dy K(mb)e””, (42) 


and 


R [oro (t)] = » (mb) e"™. (43) 


Then, one can say that ® is made up of a large periodic part with Fourier 
components ®(mb) and a vanishingly-small signal part with Fourier 
components . 


A(mb + s) = DS R'[(m — n)blv(ndb + 8). (44) 


The complex amplitudes of the alternating currents, both large and 
small, can be calculated by multiplying ¢P(w). First, the large-signal 
current will be called 7,5, with Fourier components 


ino(mb) = gP(mb)R(mb). (45) 
It should be noted that 7, (0) differs from the net direct current, zo, by 
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Fig. 11 — Linear frequency converter used as an active four-pole network. 
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the saturation current, 7; : 
izo(0) = to +2; . (46) 


Second, a complete specification of the small-signal transmission of 
the frequency converter is given by 


isic(t) = Re >, i(mb + s)e%™t* (47) 
with 
i(mb + s) = gP(mb + 8) >> @[(m — n)b]v(nb + 3). (48) 


Equation (48) can be written in the form 


i(mb + s) = >> Y(m, n; s)v(nb + 3), (49) 


where 
Y(m, n; s) = qP(mb + s)R’[(m — n)b]. (50) 
The array Y(m, n; s) is called the conversion matrix. For brevity, 
Y(m,n;s) = Yuan. (51) 


In operation, the linear frequency converter will be terminated with 
admittances Y, at each of the “terminals” mb + s. A system of current 
generators J, at each of these terminals will produce currents and volt- 
ages that obey the relationship 


Therefore, 

Im = >» (Yinn + VimOmn) Un ) (53) 
where 6mn is 1 if m equals n and bm, isO if m does not equal n. One can 
also write 


ae oe Ore ; (54) 


where 


in matrix notation. 
The effect at terminal m of a current generator J, at terminal n can 
be duplicated by a fictitious current generator nmxJn placed at terminal 
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m, where 


_ Amn : 


Nmn ae 





(56) 


does not depend on Y,, (Thévenin-Norton theorem). This property 
makes 7nn a desirable quantity to use in the analysis so that a necessary 
characteristic of the noise figure — that it be independent of the load 
admittance — will appear quickly and naturally. 


XI. NOISE IN THE GENERAL p-n JUNCTION FREQUENCY CONVERTER 


Suppose that the appropriate value of J(w) for a carrier event is ap- 
plied to each terminal of the linear frequency converter. Then the equiva- 
lent current generator at the output m2b + s will be 


» Nngnd (nb oe s). , : | (57) 


This sum can be substituted for J(w) in (26). For either a forward or a 
reverse transit, (26) and (57) give 


[ #0 at = 228 D, mentyel 68) 
The development thus far could be used to treat the nonlinear-resistor 
type of p-n junction. For the general p-n junction, forward and back 


events must also be considered. A forward pulse at f followed by a Te- 
verse pulse at fo + ?¢’ gives 3 ; 


( S(t, t) dt = 
2¢°B pe nace = go iinbtsye! _ gitn'bts)e” (59) 


BS gin —nyoey 


In all applications of (59), the weighted sum over all possible values 
of ¢’ will be required: 


i dt'P(t’) | H(t’, t) dt = 
0 
2¢7°B DL nmsntngn'e” —”[P(nb + 8) (60) 


+ P*(n'b + s) — P(nbd — n’'b) — Pol. 
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11.1 Reverse Carriers 


The thermally generated reverse carriers occur at random but at a 
steady average rate 7,/¢. Multiplying (58) by this rate gives 


ip semen = 2qBi, > | mon )?, (61) 
since the terms in (58) for which n does not equal n’ are oscillatory and 
give no contribution to the noise power when averaged for a steady rate. 

Equation (61) may be interpreted as simple shot noise applied to each 
terminal and independently converted to the output (except the term 
for n = m,, which represents the direct shot noise at the output). 

11.2 Forward Carriers 

The rate of emission [PoR(té)] of forward carriers varies periodically, 
so that an average of (58) over the local-oscillator cycle is required: 

lb 

Mt ieee = ae | i dtoPoR (to) [27°B 2s Nmantman'@n” il . (62) 
The Fourier expansion, (42), of R(t) may be inserted in (62). One then 
finds 

@Ytorwara = 29° BPo 2 A(nb — 1'b)nmenttman’ - (63) 


11.3 Forward-and-Back Carriers 


In similar fashion, one can average (60) over the local-oscillator cycle 
to obtain the mean-square output current generator for all classes of 
forward-and-return carriers: 


GO erara inane = 2¢°B 2: R(nb roe 1b) nmantmon'LE (nb ote s) 
me (64) 
+ P*(n’b + s) — P(nb — n’b) — Pol. 


1.4 Total Shot Noise . 
The sum of (61), (63) and (64) is 


@ total = 2qB (: > mon |? 


+ q DS (nb — n'b)nmentingn'(P(nb + 8) + P¥(n'b + 8) (65) 


— P(nb — n’on) 
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11.5 Image Conversion Noise 4 


The frequency converter is supposed to be terminated at the frequencies 
(mb + s)/2zx with specified admittances which may or may not generate 
noise. What noise is generated will be transmitted to the output accord- 
ing to the coefficients nom - 

The input and output terminations mb + s and mb + ¢ are in a class 
apart, because noise generated in these terminations is chargeable to the 
transducers that precede or follow the frequency converter. The other 
terminations mb + s, with m #~ m # m, will be called images. The 
frequency converter (but not the diode) must be held responsible for 
output noise originating in the image terminations. 

The properties of the image terminations as noise generators depend 
upon the physical nature of the terminations and are not uniquely de- 
termined by their impedances. For example, the conductance G,, for 
each of the image terminations mb + s might consist of a resistor at 
temperature T',, . Then the image termination noise at the output would 


be 
(Cg eee terminations = 4kB Pay Lan | Nmom i (66) 


where the symbol >.’ indicates that terms inm = m, or m = m are not 
included in the summation. The terminations may be devices that are 
not in thermal equilibrium. If several diodes are used in combination, 
so that they terminate each other at some image frequencies, the noise 
generated in the mutual terminations can be much greater or much less 
than thermal noise. 


11.6 Noise Figure 


The noise output of a linear transducer must be judged in relation to 
the signal output for a given input signal. As a standard value for the 
input signal, it is conventional to use thermal noise corresponding to an 
absolute temperature of 290°K, which will be denoted 7; . The thermal 
noise current is 7/4k7.G,B, where G, is the source conductance, and an 
rms current of this value will be the standard input signal. 

The noise figure F can be defined by 


output noise power due to transducer 
—-l= : . (67) 
output signal power from standard input signal 





The noise figure of the input stage determines receiver sensitivity if the 
gain of the transducer is sufficiently high. For the frequency converter, 


shot noise at output + image conversion noise at output 


So ee, “8 
output signal power from the standard input signal (68) 


F-1= 
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In place of power, we may use in each case the equivalent mean-square 
current generator at the output, which for the standard signal is 
GO auanea = 4kT,BG, | Nmgmy, le (69) 


Tor convenience, one may define 


poy = eee, = Ame (70) 


mgm, Moms 


Then the total noise figure may. be written 


F-1= yr RIG. Le {ad a R(nb SF: n'b) untn’*[P (nb ze s) 
+ P*(n'b + s) — P(nb — n’b)] + 4, a | on = (71) 


The last term of this equation represents image conversion noise. It 
is an important practical problem to design the circuit to minimize image 
conversion noise. But, in principle, this can always be made to vanish 
by refrigerating the terminations, so the subsequent theoretical discus- 
sion will consider only the shot noise. 


XII. NONLINEAR CAPACITORS 


A nonlinear-capacitor type of p-n junction is one in which all emitted 
carriers are collected within a short time after emission. If all emitted 
carriers are collected, Po is zero, since it is the probability of a carrier 
not being collected. Hence, there are no forward-only carriers. More- 
over, the reverse current 7, of the thermally generated carriers must be 
zero. One may show this by considering the diode in equilibrium, when 
the net direct current must be zero; if there are no forward-only carriers, 
there must be no reverse-only carriers. 

Thus, the only shot noise is that due to forward-and-back carriers, 
governed by P(t’). The exact form of P(t’) is not too important. The 
simple rectangular function shown in Fig. 12 will serve to illustrate the 
main result. The maximum storage time, A, will be allowed to become 
very small compared to 1/w, to approach the case of an ideal nonlinear 
capacitor. The function P(w) is, according to (10), 


A 
Pw) =1-[ edt! = bjoh +4 (oA) Hee. (72) 


SHOT NOISE IN p-l. JUNCTION FREQUENCY CONVERTERS 975 


The shot noise due to forward-and-back carriers is obtained when this 
P(w) is inserted in (71). Each term contains a factor 


P(nb + s) + P*(n’b + 8) P(nb — n’b) = 
1 i(nb + s)\A — 4 j(n’b + s)A — 4G(nb — n'd)A 


se . (Gibbs) E's)? = Gb = ae) OS) 


= (nb + 9)(n'b $5) + + 


The first-order terms in A cancel. As A goes to zero, 1.e., as the fre- 
quency response of the diode is increased, the shot noise goes 6 zero as A’, 


P(t’) 





ola be 


Fig. 12 — Example of a return probability distribution leading to nonlinear 
capacitor action. 


The capacitance, for a given voltage, 1s proportional to A. To approach 
an ideal nonlinear capacitor, let A approach zero. At the same time, the 
capacitance waveform can be kept essentially unchanged if the emission 
rate is increased (by appropriate adjustment of the local-oscillator volt- 
age). The shot noise then goes to zero with A and we get this result: A 
frequency converter using an tdeal nonlinear capacitor has no shot noise 
output regardless of the local-oscillator waveform, the source and load ad- 
mittances, and the image terminations. 

An interesting qualitative interpretation* of this result is that, as A 
gets smaller, an increasing number of carrier events are required to ob- 
tain a given capacitance waveform. A smoother current results from the 
statistical effect of having a large number of elementary events. 





* Suggested by D. Leenov. 
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If the nonlinear capacitance does not arise from the minority-carrier 
storage mechanism that has been postulated, but is wholly or partly due 
to depletion-layer capacitance, there is even less reason to expect noise. 
The depletion-layer capacitance involves widening and narrowing of the 
space-charge region. It requires a relatively small motion of majority 
carriers instead of their emission all the way across the junction, or 
of any subsequent existence as minority carriers. The frequency re- 
response of C'r should therefore be better than that of minority-carrier 
storage capacitance. Also, the elementary events are likely to be repre- 
sentable approximately as forward-and-back current generators, with 
the area under each pulse being much less than gq, since the carriers do 
not go all the way across the junction. 

The problem of finite frequency response (A ¥ 0) can be studied by 
putting the expression given in (73) in (71), to get the leading term in 
the shot noise: 


2 
F—-1=—£2_ ¥ aa(nb — n'b)(nb + s)(n'b +8). (74) 
6kT Gs an! 

It is evident in (74) that F — 1 goes to zero with A if AR(nb — n’b) is 
held constant. But, for a finite value of A, the shot noise could be very 
large in the case of significant conversion from high-order harmonic 
images. Hence, a circuit design consideration is to make yu, small for all 
values of n such that nb is greater than 1/A. Short-circuiting of the 
harmonic images is one way of accomplishing this objective. 

Apart from finite frequency response, actual diodes will fall short of 
ideal nonlinear-capacitor action because of the finite lifetime of minority 
carriers. Some minority carriers will recombine before they have a 
chance to return to the junction. The effect of this recombination is 
equivalent to that of a nonlinear resistor in parallel with a nonlinear 
capacitor having infinite minority-carrier lifetime. As will be shown, 
low-noise frequency conversion is possible with nonlinear resistors when 
the local-oscillator current is sharply pulsed. Therefore, low-noise fre- 
quency conversion is possible with a nonlinear capacitor suffering from 
recombination, but the local-oscillator waveform can no longer be arbi- 
trary, and the conversion gain will be reduced. 


121 Nonlinear Capacitor Structures 


Four semiconductor structures shown in Fig. 13 exhibit nonlinear ca- 
pacitance, for reasons to be discussed. The first of these can be made 
from a transistor, and thus is a readily available specimen for relatively 
low-frequency (about 10 mc) experiments with nonlinear capacitors. 
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However, the great promise of the nonlinear capacitor is the possi- 
bility that a diode structure can be made to work at higher frequencies 
than transistors. Transistors, as will be explained below, are not one- 
dimensional structures, a fact that aggravates the series-resistance prob- 
lem. The pt-n-nt and p-i-n and the graded p-n junction can be truly 
one-dimensional structures, as long as skin-effect is not important. Of 


iPLANE OF 
|SYMMETRY 














WwW > (Npt)-(Na-) 
de x 
(b) 


(Np+)-(N a-) 











(N p+)-(N a-) 





Fig. 13 — Nonlinear capacitor structures: (a) pt-n-pt; (b) pt-n-nt; (¢) pt-i-n*; 
(d) graded p-n junction. 
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these, the graded p-n junction seems to be the easiest structure to fabri- 
cate by solid-state diffusion of impurities into semiconductors. 


122 p'-n-p* Structure 


The p*-n-p* structure in Fig. 13(a) can be made by connecting the 
emitter and collector leads of an alloy-junction transistor. A symmetri- 
cal alloy transistor (one with emitter and collector junctions of equal size) 
is theoretically preferred for maximum nonlinear-capacitor action. The 
pains taken by the transistor manufacturer to prevent emitter-collector 
short circuits are, of course, completely wasted in this application. 

The emitter and collector junctions thus become, in effect, one junc- 
tion which almost completely surrounds the n region. Because of the 
high concentration of holes in the heavily doped p* region, compared to 
the concentration of electrons in the n region, nearly all of the emitted 
carriers are holes entering the n region, rather than electrons going into 
the p* regions. Holes going into the n region may recombine and become 
forward-only events. But if the base width 2w is narrow enough, most 
of the holes will escape recombination, either by returning to the junc- 
tion from which they were emitted or by being collected at the opposite 
junction. In either case, such holes become forward-and-return events 
with respect to the Junction, consisting of emitter and collector junctions 
taken together. 

The n region is assumed to be of uniform resistivity and thus to be 
field-free except for the space-charge regions at the junction. Hence, the 
motion of the emitted holes is simple diffusion. The diffusion equation 
(13) then reduces to 


(jo + 1/7)po = DV’ pa . (75) 


If one lets x equal zero at the plane of symmetry indicated in Fig. 
13(a), the solution of (75) appropriate to this symmetry is 


Do © cosh av/(jo + 1/r)/D. (76) 
Atzx = uv, pw equals pr, , SO 
tates cosh x/(jo + 1/7)/D (77) 


” cosh wV/ (jo + 1/7)/D 


The current density is given by gDVp evaluated at x = w. Then, by 
comparison with (15), one finds that 


1/Ly = W(Gow + 1/7)/D tanh wV/(jo + 1/7)/D. (78) 
According to (18), 1/Z, is proportional to P(w). For comparison with 
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Fig. 14 — One-dimensional versus two-dimensional bipolar (hole and electron) 
flow patterns in Junction devices. 


(58), recombination may be neglected by setting 1/7 ~ 0, and (64) can 
then be expanded on the assumption that w is small (i.e., that the fre- 
quency response is high relative to w). One finds 


Peo) «p= Ape et (e) | (79) 
Leo Ww D 3\ D 

Hence, the leading terms are proportional to those of (72), with w’/D 
playing the role of the time A in which (nearly) all of the emitted car- 
riers return. 

The postulated condition of neutrality of the n region implies that, 
when holes are emited into the n region, an essentially equal number of 
electrons must enter the n region from the base contact. A voltage drop 
is required to move these carriers, and this may be crudely described as 
a series resistance. The thinner the base layer is made for the sake of 
high frequency response, the higher the series resistance. This dilemma 
is well known in transistor design. As shown in Fig. 14, the electrons move 
in a direction perpendicular to the average motion of the holes; hence, 
two directions in space are involved. Thus, any one-dimensional analysis 
of the transistor is inherently an oversimplification, although sometimes 
a very useful approximation. 


12.3 p'-n-n* Structure 


As an introduction to a structure that is truly one-dimensional, note 
that the plane of symmetry in Fig. 13(a) could be replaced by a surface 
that has the property of reflecting holes. A heavily doped n* region would 
have this property, and this suggests the p’-n-n™ structure shown in 
Fig. 13(b). Equation (78) should apply to this structure as well as to 
that of Fig. 13(a). 
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For each element of p*-n junction area, there is a corresponding area 
of n-n* junction that reflects emitted holes and also provides the elec- 
trons to neutralize the holes. Thus, the area of this structure can be in- 
creased in two dimensions without affecting its performance, except for 
a proportionate lowering of impedance level. This is a characteristic of 
a one-dimensional structure. 

The depletion-layer etching technique has been used to make very 
thin germanium p*-n-x structures, where x is supposed to be a region of 
high recombination.” The unsolved problem in making this structure is 
that of obtaining a satisfactory x region for nonlinear resistance action 
at high frequencies. No such problem exists in making a p*-n-n* non- 
linear capacitor, since a well-understood n* region is required in place 
of the little-understood x region. 


12.4 p'-i-n* Structure 


The p‘-i-n® diode is another one-dimensional structure that has a 
carrier-storage behavior described qualitatively by (78) because the 
basic carrier-storage action is quite similar to that of the p‘-n-n* struc- 
ture. Thus, holes emitted from the p* region are reflected at the n* 
junction. In addition, electrons emitted from the n* junction are reflected 
at the p* junction. 

An accurate analysis of the p’-i-n™ structure requires consideration of 
the simultaneous motion of holes and electrons, and would represent an 
advance over present p-n junction theory. In the meantime, the qualita- 
tive considerations mentioned above indicate that the p*-i-n* structure 
should give nonlinear-capacitance action up to frequencies determined 
by the width of the i region. At much higher frequencies, the structure 
will just be a resistance whose value depends upon the density of car- 
riers in the i region, which density is proportional to the forward direct 
current. 


12.5 Graded p-n Junction 


By drawing a smooth curve through the concentration gradients for 
either the p*-n-n* or the p°-i-n® structures, one obtains the graded 
p-n junction shown in Fig. 13(d). 

The graded junction has features that recommend it as a nonlinear- 
capacitor structure.® The electric field associated with the impurity 
gradient tends to reflect emitted carriers — the steeper the gradient, the 
larger the field and the higher the frequency response of the capacitance. 
Analysis of the frequency response, based on the approximate kinetic 
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theory of p-n junctions, can apply (if at all) only to an intermediate 
region of bias where emission is significant but not so copious that the 
electric field is altered by the forward current. 

The frequency response of the capacitance will depend on bias, being 
very high for reverse voltages and deteriorating gradually as the voltage 
approaches and enters the forward region. However, microwave experi- 
ments suggest that the series resistance will generally be a much more 
important source of loss and noise than any frequency limitations of 
the nonlinear capacitance. A steep impurity gradient makes for a series 
resistance that is low both in absolute value and in relation to the ca- 
pacitance. However, the capacitance per unit area increases as the 
gradient increases. 


XIII. NONLINEAR RESISTOR NOISE 


The nonlinear resistor type of p-n junction has the property that none 
of the emitted carriers return, except possibly some carriers that return 
in a time negligibly short in comparison to the frequencies under con- 
sideration. Within the frequency range of nonlinear resistor action, the 
admittance due to emitted carriers is real and independent of frequency. 

In spite of the absence of return pulses to cancel the shot current of 
emission, the shot noise delivered to the load can be made very small if 
the local-oscillator current flows in short pulses. The plausibility of this 
result will be argued by representing the p-n junction as a noiseless ad- 
mittance in parallel with a generator of shot current pulses. The shot 
current and the time variation of the admittance are governed by the 
local-oscillator drive. For the nonlinear resistor, the admittance is a pure 
conductance and the shot current to be considered is that of emitted 
carriers that do not return. 

In a frequency converter, the nonlinear resistor is connected to a linear 
passive network which includes the terminations at all the signal fre- 
quencies. If practically all of the shot current flows at a time when the 
junction conductance is very large compared to the network admittance, 
the shot current will be short-circuited through the junction conductance 
and little noise will be delivered to the network. 

Another way of describing the situation is to consider that shot noise 
results because one cannot predict with certainty that a carrier will be 
emitted in any given short time interval. When a local-oscillator voltage 
is applied to a junction, it is known that the sum of the carrier currents, 
in spite of their individual unpredictability, is very nearly equal to a 
definite macroscopic current waveform that can be determined from the 
junction characteristics. When this macroscopic current consists of short 
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Fig. 15 — Pulsed local-oscillator current waveform giving partial reduction 
of shot noise. 


sharp pulses, it is possible to predict that many carriers will be emitted 
during the pulses and practically none during other times. Unpredicta- 
bility, hence shot noise, is greatly reduced for a pulsed current waveform 
like the one shown in Fig. 15. 

The principle of using pulsed local-oscillator current can be applied 
in the design of nonlinear-resistance frequency converters. In a simple 
design, for example, it would be preferable to use shunt conductance 
rather than series resistance to attenuate the power from the local oscil- 
lator. More sophisticated mixers may use filters and additional diodes to 
shape the local-oscillator waveform. 

The image terminations chosen below to illustrate the waveform effects 
should not be regarded as the optimum conditions for practical use. For 
example, the main image frequency might well be open-circuited to 
minimize conversion of thermal noise from the series resistance, while 
short-circuiting of this image will be assumed for mathematical sim- 
plicity (in the case of short-circuited harmonics). 


13.1 Short-Circutted Harmonics 


As an example of a circuit permitting low noise and good signal trans- 
mission, the mathematically simple case of short-circuited harmonics 
will be considered. Another way of defining this case is to say that the 
local-oscillator voltage waveform is purely sinusoidal (plus a de com- 
ponent, vo). Thus, 


v(t) = vo + vp cos bt. (80) 
The exponential emission function, (1), will be assumed, so that 
qPyR(v) = ie” (81) 


Oe a aaa (82) 
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The constant 7, is necessarily equal in magnitude to the thermally gen- 
erated reverse current, so that no net external current flows when » 
equals zero. From (81), 


gPyR' (v) = Bie” (83) 
which, with (80), gives 


gPyW' (v) = Bine??%eP0°**, (84) 


The Fourier expansion of (84) involves modified Bessel functions of the 
first kind,”® denoted by J, . Thus, 


gP&' (nb) = Bise”"T, (Br). (85) 
From (50) and (85), one finds that the conversion matrix is 
Y(m, n; s) = Bise”’’Lm—n(Bvs). (86) 


To take a specific example, conversion between two frequencies dif- 
fering by b/2z will be considered [such as (b + s)/2m and s/2z]. All 
other signal frequencies (images) will be assumed to be short-circuited. 
Then the conversion matrix reduces to 








| Monn) | = a6. + io YI, (87) 
where 
L 1(Bv») 
= 8 
Io(Bvs) ee) 
and 7%, the direct current, is introduced in view of the relation 
io = ise°To(Bv,) — 1], (89) 


which may be obtained from (11). 

For minimum conversion loss, the four-terminal network represented 
by (87) should be used with source and load admittances that simultane- 
ously match the input and output admittances of the network. These 
admittances are, of course, real for a nonlinear resistor and are found, by 
standard methods,” to be 


Yin = You = G. = Gr = Bice”’To(Bu)/1 — 4. (90) 


Since the images are short-circuited, all the u,’s defined in (70) are 
zero except 


Mm, = 1 
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and 


1 Le 

ee Vee (91) 
y 

For the noise figure, algebraic manipulation that will not be detailed 

here gives 








7 u+oé 
F-l1= 92 
su we’ 2) 
where 
u=V1l-y¥ (93) 
and 
a 
= 10 + Vs ° (94) 


Equation (90) gives the dependence of Ff, upon the local-oscillator 
drive (represented by uw) and the direct current, represented by 6. The 
quantity @ defined in (94) is the ratio of the thermally generated current 
to the current carried by emitted carriers. 

For any value of 6, there is a certain value of wu, 


w= S/P+6—- 6, (95) 


that gives a minimum noise figure 
(96) 


As 6 approaches zero, Fimin approaches unity. At the same time, the opti- 
mum value of « approaches zero. The optimum local-oscillator voltage 
goes to infinity and the conversion loss approaches unity (that is, no 
loss at all). This result shows that low noise figure and minimum con- 
version loss are not incompatible. 

For most practical microwave diodes @ is quite small, since 7, may be a 
tenth of a microampere and the direct current 7 is of the order of a milli- 
ampere. An approximate formula is therefore relevant: 


For 6 = 10°, Finin = 1.02 = 0.1 db. Thus, the observed thermally gen- 
erated currents would not produce much noise and probably are not 
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responsible for much of the observed noise figure of crystals, except pos- 
sibly at elevated temperatures. 

This analysis has considered only the shot noise of the emitted car- 
riers. In actual diodes, the series resistance and the depletion-layer ca- 
pacitance impair the noise figure. The depletion-layer capacitance should 
be essentially noiseless, but, in combination with FR, , it sets bounds on 
the usable impedance level of nonlinear resistance of the emitted car- 
riers. With the impedance level thus established, R, causes a definite 
signal loss and contributes thermal noise. Furthermore, R, interferes 
with short-circuiting the local-oscillator harmonics or otherwise obtain- 
ing the pulsed local-oscillator current waveform essential for low shot 
noise. 


13.2 Open-Circuited Harmonics 


When the harmonics of the local oscillator are open-circuited, the 
local-oscillator current is sinusoidal (in addition to a direct current). It 
will be shown that this waveform is unfavorable for noise, although it 
permits a low conversion loss. 

The emission rate and the local-oscillator current are simply related: 


gP R(t) = tro (t) te = (ao a is) (1 Sie rie cos st), (98) 


where 
0-2 3.1; (99) 


The reason for introducing the quantity & will be apparent later. The 
Fourier analysis of PoR(t) is obtained by inspection of (98): 


gPoR(nb) =a +7, ifn =0 


= (Zo ai 7) ‘ a= 
= 0 otherwise. 


Since an exponential emission rate is being considered, ®’ equals BR 
and, from (50), 


Y(im,n;8s) = Bo +i) ifm=n 


1+e 


= 0 otherwise. 


ifm=n+t1 (101) 
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The admittance representation of a frequency converter lends itself 
readily to consideration of cases where the image frequencies are short- 
circuited, since the elements of Y(m, n; s) corresponding to the images 
may then simply be omitted. One finds, however, that, with open-cir- 
cuited harmonics and short-circuited images, the conversion loss. is at 
least 3 db, quite apart from the further losses due to the parasitics. As 
an absolute theoretical limit, this conversion loss appears to be ex- 
cessive. In seeking an escape from this limit, one naturally wonders if 
open-circuited images might be appropriate for open-circuited harmonics. 

Indeed, open-circuited images permit unity conversion gain. To deal 
with open-circuited images, we may invert the infinite-dimensional 
matrix || ¥(m, n; s) || to obtain the conversion matrix in impedance 
form: 


(—pih mite 
Bo + 4)1 — 





Z(m,n; s) = (102) 


The images can now be open-circuited by considering only the 2 x 2 
impedance matrix pertaining to the signals. This matrix may be inverted 
to give the admittance matrix for open-circuited images: 


|| Y || = Bio ++ 1s) 





reo 


Tor the conversion between two frequencies differing by b/27, with all 
images open-circuited, one has, in analogy with the short-circuit case, 


Yin = Your = Gir = Gs = Bo tts)V1 — &. (104) 
The noise figure is found to be 


_ 7 1402-2) 
pe T. 21 — 2)(2 — 2)’ pe) 


where 
2=V1—#. (106) 


For any value of z, the lowest noise figure is obtained when 6 equals 0, 
that is, when the thermally generated current is negligible. For 6 = 0, 
the minimum noise figure is 3.25 = 5.1 db; this occurs when z equals 
0.45 and the conversion loss is 4.2 db. Evidently the noise performance 
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with open-circuited harmonics is inferior to that obtainable with short- 
circuited harmonics. 


XIV. CONCLUSIONS 


The nonlinear capacitance of a p-n junction can be used in amplifiers 
and amplifying frequency converters. To the extent that the junction 
exhibits pure capacitance, it will be noiseless in any of these applications. 
Suitably designed p-n junctions are remarkably good nonlinear capaci- 
tors at microwave frequencies, and preliminary experiments indicate 
that good noise figures can be obtained. 

A pulsed local-oscillator current is required for low-noise frequency 
conversion with nonlinear resistance p-n junctions. This requirement is 
a logical consequence not only of p-n junction theory, but also of earlier 
metal-semiconductor rectification theories that assumed emission of 
carriers across a barrier. 
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Gain and Noise Figure of a 
Variable-Capacitance Up-Converter™ 


By D. LEENOV 
(Manuscript received March 31, 1958) 


The performance of a p-n junction nonlinear-capacitance diode as a 
low-noise, amplifying frequency converter is analyzed for the case in which 
the output signal frequency 1s many times greater than the input signal 
frequency. The diode is represented by an equivalent circuit consisting of a 
time varying capacitance and a constant series resistance. Formulae for the 
maximum available gain and the noise figure of the circuit are obtained, 
and representative numerical values are given for the noise figures of sys- 
tems incorporating such diodes as preamplifiers. 


I. INTRODUCTION 


Since microwave receivers frequently must operate at a very low in- 
put signal level it is important to obtain the smallest possible noise 
figure. One way of accomplishing this is to use a low-noise amplifier at 
the input of the receiver. In this article the role of a nonlinear-capaci- 
tance p-n junction diode in performing such a function is analyzed. 

The performance of a variable capacitance as a frequency converter 
has been extensively discussed in the literature.’” Theory indicates 
that when the variable capacitance is used for conversion between a low 
frequency and an upper sideband (a noninverting frequency), the maxi- 
mum available gain is fo/fi , fi: being the input frequency and fo being 
the output frequency. Hence in up-conversion (modulation) a power 
gain results, with the power added to the input signal being supplied — 
by the beat oscillator. In down-conversion a power loss results. (These 
statements apply when no signal power is transferred at the lower side- 
band or at any of the harmonic images). The situation is different when 
one of the two frequencies involved is a lower sideband (an inverting fre- 
quency). A negative resistance may then appear at both sets of terminals, 


* Supported in part by the U.S. Army Signal Corps under contract DA 36-039 
sc-73224. 
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resulting in regenerative gain or stable oscillations.* Large gains may be 
obtained when going either up or down in frequency. The present paper, 
however, is restricted to a consideration of upper sideband frequency 
conversion. Analysis of lower sideband operation is now in progress. 

In this article the frequency conversion performance of a nonlinear 
capacitance (N-C) diode with parasitics is analyzed. In Section V the 
maximum available gain (MAG) is calculated and is compared with the 
MAG predicted for an ideal device. In Section VI, an equation for the 
noise figure of an N-C diode is obtained, taking into account only the 
thermal noise of the series resistance. Over-all noise figures are given in 
Table I for three types of receivers with diode preamplifiers, operating 
at specified input and output frequencies. 


Il. THE EQUIVALENT CIRCUIT 


For the calculations to be made here, the p-n junction might be repre- 
sented by the lumped-parameter equivalent circuit shown in Fig. 1. 
Here R, and Cy are as defined and the “variable admittance’”’, shown as 
lumped parameters, accounts for the current carried across the junction 
by the motion of carriers through the space charge region. A calculation 
of the mixing action of the diode, using Fig. 1, would involve a rather 
complicated formulation.’ However, the silicon and germanium re- 
tarding-field diffused diodes now under investigation are fairly accurately 
represented by the equivalent circuit shown in Fig. 2. (The design and 
properties of these devices are discussed in other papers).’’”° In this 
diagram, Cmin represents the smallest capacitance attainable, determined 
by the limitation on reverse bias voltage imposed by breakdown, and 
Cmax is the largest value, determined either by physical limitations or by 
choice. 


Cr. 










DIELECTRIC 
CAPACITANCE 


SERIES 
RESISTANCE 


VARIABLE 
ADMITTANCE 


Fig. 1 — Equivalent circuit for nonlinear-capacitance diode. 
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OsC (t) S$ Cuax- Cun 


Fig. 2 — Simplified diode equivalent circuit. 


III. ASSUMPTIONS FOR CALCULATION OF GAIN AND NOISE FIGURE 


In this article the gain and noise figure of an N-C diode frequency 
converter are calculated according to the following assumptions: (1) 
Upper sideband operation is assumed; that is, an input frequency /j; is 
converted to an output frequency fo using a beating oscillator at the 
frequency fo — f;. (2) The lower sideband, of frequency fo — 2f1, and 
all the harmonic images are assumed to be open circuited, implying that 
no power is delivered by the frequency converter at these frequencies. 
(3) The equivalent circuit of Fig. 3 is used. Here, filters are indicated 
which restrict transmission in the left-hand branch to a narrow band 
about the frequency f; and transmission in the right-hand branch to a 
narrow band about fe ; & and & are signal voltages and &y; and &y2 are 
noise voltages at frequencies f; and fe respectively. The equivalent cir- 
cuit of the diode which was discussed in connection with Fig. 2 is as- 
sumed applicable over the operating range of pump voltage and frequency. 


BANDPASS BANDPASS 


FILTER FILTER 






NOISE 
INPUT OUTPUT 
IMPEDANCE GENERATORS IMPEDANCE 


SIGNAL 
VOLTAGE 


SIGNAL 
-) VOLTAGE 


Fig. 3 — Equivalent circuit for the calculation of gain and noise figure. 
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Using these assumptions, results have been obtained in a simple form 
which can serve as a guide to diode development and application. It is 
not supposed that the model used here constitutes a very accurate charac- 
terization of the frequency converter operation. In particular, assump- 
tion (2) is theoretically possible, but there would be considerable prac- 
tical difficulty in satisfying these conditions in a microwave circuit. 


IV. GENERAL EQUATIONS FOR FREQUENCY CONVERSION 


In subsequent sections we shall need to calculate, for the circuit 
shown in Fig. 3, the power delivered to the load at frequency f2 when 
specified signal voltages at frequencies f; and f, are applied. Hence, 
what is desired is a means of calculating the current 72 for given signal 
voltages. For the specific applications to be made later on, we shall ob- 
tain in this section a solution in general form for the currents 7; and 72 
resulting when signal (or noise) voltages & and & , at frequencies f; and 
fo respectively, are applied to an N-C diode. 

It is possible to express the relation between currents and voltages of 
a frequency converter by means of an admittance matrix; this will be 
done here for a nonlinear-capacitance frequency converter. Let the beat 
oscillator voltage be represented by Vo, and the combined signal volt- 
ages by 6V. If noise is of any consequence, one may surely assume 
5V « V.. Hence we may write (1), representing the charge on the 
capacitor as a function of voltage, in the form of a Taylor expansion, 


QV) = Qv, + (2) ee (1) 


The subscript Vo signifies that Q or its derivatives have the value deter- 
mined by Vo alone, i.e. 6V equals 0. It is assumed that terms of higher 
order in 6V are negligible, so that the relations to be derived will be 
linear in the signal quantities. (For a more complete theory of linear 
frequency converters, see Ref. 6). 
To evaluate (1) we expand (0Q/d@V),, in a Fourier series, 
dQ 


_ ws os = gnbt 
es C(t) + Cnin 2a Ces (2) 


with the beat oscillator voltage assumed to be periodic, with pulsatance 
(angular frequency) b. The signal voltage 6V may involve all the har- 
monics of the beat oscillator pulsatance mixed with the signal pulsa- 
tance s. Hence it is written 


io} 


6V = z= > tere (3) 


m=—o st 
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Referring to (1), it is seen that the first term on the right-hand side con- 
tains only pulsatances which are harmonics of b. The second term, which 
we designate 6Q, contains the various signal pulsatances. It is given by 


5Q = (32) BV = LY Covmagreitomtmrtelt (4) 
OV Vo nym st 


Since we shall want to select individual pulsatance terms from the right- 
hand side of (4), we set m + n = lI. The result is 


60S DO Cie (5) 
lm st 
Also, being interested in the current components, we use the relation 
to = Jog. , obtaining finally 


bre = Dy JD + 8)\C hand moes (6) 


(we have here stopped considering the terms containing —s, because 
ips = 1-4. ). Equation (6) is equivalent to the matrix equation 


Libts = » Y mV mb+s ) (7) . 


where the elements Y;,, are given by 
Yim = j(lb + s)Crim . (8) 


The admittance representation is convenient to use when only a few 
signal voltages have specified values and the rest are zero (i.e. short- 
circuited). In the case of the equivalent circuit used here, none of the 
voltages can be short-circuited, because of the diode series resistance. 
The possibility remains, however, of presenting open circuits to all the 
image pulsatances (all the mb + s) except for the two involved in fre- 
quency conversion, s and b + s. This means that all the currents except 
7, and 7+; will be assumed to vanish. The relation between voltage and 
current can then be simplified when it is expressed in the impedance 
matrix form. Hence we must obtain the matrix inverse of || Y ||, which 
is |Z = YI. 

There is at least one case in which this matrix inversion can be readily 
performed. If éQ/dV equals Co + 2C; cos bt then only the coefficients 
Co and C; in the expansion (2) do not vanish. The matrix of (8) is now 
limited to elements on the diagonal and once removed from the diag- 
onal. As a product of infinite matrices (in these arrays only the nonzero 
elements are indicated), it is written 
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| ¥ |] = 
j(—b + 8) Ch Ct 
qs 
Op 05-G . (9) 
j(b + s) 
j(2b ae s) Ci Co Ci 








The second matrix on the right-hand side of (9) may be factored to 
obtain (10), 


Co dg 


I| Com {| = Co el % ; (10) 


gf 1 ¢ 


where ¢ = C,/Co. Equation (9) can be written symbolically as 


| Yl] = 7@b + 8) | I] Com ll, 
and, by a rule of matrix algebra, the impedance matrix is given by 
|Z Wl = YW = W Com I pb + 8) IO. (11) 


The first matrix on the right-hand side of (9) is diagonal, hence its in- 
verse is easily obtained. The inverse of the second matrix is also known.° 
If a matrix has the form 


Mi = 1s if ~= 7 
=i $f ge pad (12a) 
1+?’ 
= 0; otherwise, 


the inverse matrix is given by 





Bis 2.) dee e (dl 
(My = GE (9 (12b) 
This result can be used to obtain || Crm ||’, if we set ¢ = &/(1 + &) 
[see (10)]. If it is now assumed that all pulsatances in the spectrum, other 
than b + s and s, are open-circuited, the impedance matrix reduces to 
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= 0 
_ lite} 1 él Iie 
as —£ 1 1 ig) 
j(b + 8) 
To simplify future work, we shall use the abbreviated representation 

Zu Ziz 
A= : (14) 

Zn Zo 





The matrix Z, represents the capacitative impedance of the diode. 
The total impedance Z must include the series resistance F; it is given 


by 





Zu + R Zi 
Z= : (15) 
Za Zo + Rk 
By the definition of an impedance matrix we have 
v1 Zuthkh Ly 1y 
= (16) 
v2 Zo Zo + kh 1 














Here v; and v2 are the voltages across the diode terminals at the input 
and output frequencies, respectively, and 7, and 7 are the corresponding 
currents. It follows from Fig. 3 that these two voltages are given by 


v1 = & — Ze 











(17) 
V. = & — 1h t 
Substituting (17) into (16), and rearranging, we obtain 
& Zu + hk + Ze Lie ay 
= (18) 
&2 Za Zo + Rh + Z, 12 
The solution for the currents is written in matrix form 
14 Zu1+Rk+ Ze Ly * Il 8 
= ; (19) 
19 Zu Lx + R a Zr, &e 




















where the reciprocal matrix is given by 
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Zu +R + Ze Zi. i 
Za Ze + R + Zr 
(20) 
Ze + R+ Z, —Zy 
1 
a : 
—Z Zu + R + Ze 
and D is the determinant of the matrix: 

D= (21 + R + Za) (Zo. +R+ Z1) — ZyZ. (21) 


The output current, and hence the output power, may be determined 
from (19) when the voltages & and & are specified. This equation will be 
used as a starting point for calculations of gain and noise figure. 


V. GENERAL EQUATIONS FOR MAXIMUM AVAILABLE GAIN 


We shall calculate the gain of the frequency converter by using the 
condition 


ah. (22) 
& = 0; 
where & is the signal voltage. The gain G is defined by 
. . 2 
power absorbed in Z, _ || Re Z, (23) 


~ power available at input | & 2/4 ReZo’ 


We shall calculate G with the aid of (19), (20) and (22), and then deter- 
mine the values of Z¢ and Z, leading to a maximum G, designated by 
Gnade . 
We first solve for 72 by substituting the conditions of (22) into (19) and 

obtain 

, 1 

QW = ~D Zags. (24) 
Hence, for the gain we have 





| Zu 18? Ri/| DP _ | Zo |’ 
[eRe TD a 
which, with the aid of (21) becomes 
2 
G ARR, | Zar | (26) 


© [Gu FR Ze)Gn +R + Zt) — Dade 
To maximize G, we first impose the tuning conditions 
jim Ze = —Zy, gimZ,= —ZDo. 
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[Note that, according to (13), the quantities Z,, and Z2. are pure imagi- 
nary numbers]. As a result, we obtain 


7 4ReR,, | Zn | 
(hk ae Re)(h = Rx) a Zi2LZn\" 


Here Rg is Re Zz, Rr is Re Z,, and ZZ. is a real number (since it is 
the product of two imaginary numbers), and we may dispense with the 
absolute magnitude sign in the denominator. 

To obtain Gnmax we must maximize the right hand side of (27) with 
respect to Rg and R, . The values of R, and R, satisfying this condition 
are the matching values, and will be designated by R,’ and F,’. They 
may be obtained by solving the pair of equations 


G (27) 


0G 
ae 
‘ (28) 
0 
oR = 0. 
On solving these we obtain 
R’K’ + RR, 
Re = ——__———-, 
R+ Rt 
Paes (29) 
tae R’K’ + RRe 
= R+Re ’ 


where K = V/1 — ZZ2,/R?. These equations give the value of Rg 
which maximizes G for any selected value of R,, and correspondingly 
for R,. The matching values AR,’ and FR,’ satisfy (29) simultaneously. 
They are 


Ro = Ri! = KR. (30) 


The equality of the input and output matching resistors could have been 
predicted from the form of (27) for G, which is completely symmetrical 
in Rg and R, ; i.e. an interchange of Rg and FR, , whatever their values, 
does not alter the value of G. Thus, let us suppose that we determine 
R,’ and R,’ experimentally by trial and error, and assume that they are 
a unique pair. Interchanging them must leave G a maximum, hence 
R,’ equals Ry’. 
An expression for Gmax may now be obtained by substituting (80) into 
(27). We find, noting that ZZ. equals — R’(K’ — 1), 
AK?R’ | Zs |? | Zos |? 


Gmax SS SS a ee a 


[R(K FIP — Zuda RUK + 1)" (31) 


Equation (31) expresses the maximum available gain as a function of 
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the diode parameters R and Cimin , and of the time variation of the diode 
capacitance, through the quantities Cy and C,. For a given diode the 
first two quantities are assumed fixed, while the last two may be varied 
within limits set by their status as Fourier coefficients of a real, positive, 
periodic function of time. In the Appendix, Gmax is evaluated explicitly 
in terms of R and Cmin for the case that Z, is given by (13). The result 
is 


_h | 
Cm = FO (32) 


where 7/1 + 1/x? = K as introduced in (29), and 
c= A@C min R, 





with \ a numerical factor depending upon Co and C,, and 
Oo = V ww = Vs(b + s). 


To simplify future analysis, we write 





i) 
x=At, 33) 
ji ( 
where 
S= Voit 
and 
1 ; ; 
fi. = IaRCan a figure of merit for the diode. 


The expression for x in (33) is convenient for evaluating Gmax from 
(32). We shall first: consider low-frequency operation, where f is very 
much less than f,/A. Then x is very much less than 1, and 


Gace ~ fol fi . 


As x increases, Gmax/(fe/f:) monotonically decreases from its asymptotic 
value of 1. This is shown graphically in Fig. 4. Since x = Af2rR.C min , 
the best operation is obtained for minimum values of R,, Cmin and X, 
or a maximum value of f./A. In the Appendix it is shown that, for the 
type of operation under consideration, the minimum attainable value of 
d is 5.83. 

We next consider high-frequency operation, characterized by 
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f > f./X or x > 1. It is then convenient to rewrite (32) in the following 
form: 


Gans = fa ! = fe Ste os : 
4 y+) fiz? (1 + K) (34) 


As x approaches ©, (34) becomes 














lin Caio Gig ie a (35) 
igs ° "Af 2? 2rhfi 
Since we would like to have Giim as large as possible, it is evident that 
f-/X should be large, or, correspondingly, that R;, Cmin and » should 
be small, as was true for the low-frequency limit of operation. 
Equation (35) indicates that, for a given input frequency f; , the gain 
should approach a limiting value as fe approaches ©. This behavior is 
shown graphically in Fig. 5. It is noteworthy that Gjim decreases with 
increasing f, , and in particular that, for f; = f./2A, Giim S 1. This places 
a fundamental upper limit on the input frequency, if power gain is to be 
obtained. 
The above results show how the maximum available gain of a diode 
amplifier is affected by parasitics. Of greater significance in characteriz- 
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Fig. 4 — Ratio of MAG of a diode with parasites to the MAG of an ideal 
variable capacitance, plotted against the parameter z. 
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Fig. 5 — The ratio Gmax/Giim versus 2, the input frequency f; being kept con- 
stant while the output frequency f2 varies. 


ing its performance is the noise figure, which is discussed in the next 
section. 


VI. GENERAL EQUATIONS FOR NOISE FIGURE 


A p-n junction diode is thought to contain two sources of noise power, 
one being shot noise at the p-n junction, the other being thermal noise 
in the series resistance R of the bulk semiconductor. It has been shown’® 
that the shot noise of the p-n junction of a nonlinear-capacitance diode 
approaches zero so long as the frequency is high enough for recombina- 
tion to be negligible, and not so high that the p-n junction no longer 
exhibits purely capacitative behavior. Hence in the following calculation 
of noise figure only the contribution of the thermal noise of R is con- 
sidered. 
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The noise figure of an N-C diode frequency converter is calculated 
using the equivalent circuit of Fig. 3 and the assumptions of Section 
III, together with the additional assumption that the noise generators 
at the frequencies f; and fo are uncorrelated. 

The noise figure is calculated in accordance with 


ee 


noise output originating in thermal noise of R (36) 





noise output due to thermal noise at 290°K in source resistance 


Thus, as a “standard signal’, one uses the thermal noise output of the 
source resistance, assumed to be at a temperature of 290°K. The noise 
output of the diode is assumed to arise from two components of the 
thermal noise of R. One of these, the thermal noise emf &y;, sets up a 
current of frequency jf; in the left-hand loop of Fig. 3, which, by the 
frequency-converting action of the diode, is partly converted into a 
current 72 , of frequency fo ; this then passes into the right-hand loop and 
introduces noise power into the load. The other component, &y2, also 
causes a current iz, of frequency fe, to flow through the load. The 
noise power received by the load is determined by these two currents, 
which are assumed to be uncorrelated. 

The noise power can be readily calculated from (19) and (20). The 
current 72 due to &y; 1s given by solving 


1 Zn + Rk + Z, —Zy Ey1 
=5 (37a) 
12 —Z Zy + hk + Ze 0 
Similarly, the current 72 due to Sy2 is obtained from 
4 Zo + kh + Z, —Ly 0 
1 
TT (37b) 
Ie —Z Zu Fe R ae Za Ene 
Solving these, we obtain 
6m #0: bh = = Zn&n1 » (38a) 
Sie 08 SAC ReR Gs. (38b) 


~ D 
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The noise power delivered to the load is then: 


; 4AkTBRR 

Gd, = TBR Re |, + B+ ol + 1Zml), 0) 

in accordance with the assumption that the currents in (38) are uncor- 

related. Following (36), the noise figure is given by 
(is )Ry, 


BF 1 = a00KBG? 





(40) 


where 290kKB is the available thermal noise of the source resistance and 
G is the gain of the network under the same conditions, defined by (23). 
The gain was calculated in Section V, with the result in (25), given here 
by 





G = 4ReR, | D P . (41) 
Combining this with (89) and (40), we obtain 
2 2 


Re | Zax |? 290 ~ 


The noise figure here depends upon the diode structure, through RF, 
Zu and Z» ; upon the beat frequency oscillator waveform and the operat- 
ing frequencies, through Z,, and Za, ; and upon the input termination, 
through Z, . (Note that F is independent of the load impedance.) Even 
if all the other quantities are fixed, considerable variation in performance 
can be obtained by varying Zz; . In this presentation, the noise figure is 
calculated for the value of Z_ which gives maximum available gain, and 
for the value giving minimum noise figure. 

The noise figure corresponding to the condition of maximum available 
gain is obtained by inserting the appropriate value of Z¢ into (42). In 
Section V this was shown to be Zg = KR — Zy. Equation (42) then 
gives 
1R(K +1)? + |Z)? 7 
K | Zor |? 290° 


On the other hand, from (81) we have 


| Zor |’ 
RAK +1)" 


| ae (43) 





Gmiag = 


Hence, (43) may be written 


1 ff 
r-1=2(14+-2)2. (44) 


— 
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Before proceeding further, it is interesting to note that (44) can be 
obtained by a simple intuitive analysis, making use of (36) directly. We 
assume that the diode series resistance R generates noise emfs equal to 
V/4kTBR at the two frequencies f; and fe. Due to the emf at fre- 
quency f2, the load receives noise power proportional to R. The noise 
power generated at frequency f; is also proportional to R; it is converted 
to frequency fo with a gain G, so that the load receives additional noise 
power proportional to GR, making the total noise power from the diode 
proportional to (G + 1)RT. This is to be compared with the noise re- 
ceived at the load due to a matching resistance at the input. This match- 
ing resistance Rg = KR, and the corresponding noise power received is 
proportional to 290GKR. Hence, according to (36), 


ee S| 1\T 
W-1=“s00GKR 7 & Pr Ge 290 ° 


Equation (44) can be expressed in a more useful form by substituting 
an equivalent expression for K. From (84) and (85) it follows that 








4 
Giske =, Giim (1 + Kp ° (45) 
Hence 
a Giim a 
k=2 ( Gu | 1. (46) 


Substituting this into (44) gives 


1 1 T 
Fm 1+ ge — (1+ a) ee 
2(% =) — 1 max 


This is the noise figure corresponding to a matching input impedance. 
In using (47), Gmax and Giim may be determined for any set of operating 
conditions, with the help of (34) and (35). Two limiting expressions ob- 
tained from (47) are of interest: 


as Liane < 1 ) T 
Gniiis <K Gite? F = 1 ae 2 (Gus) 1 + Ce 590’ (48a) 


T 
) 590° (48b) 





Gmax & Giim: pei+ (1 gh 

Gina 

Hence, when Ginax 18 very much less than Giim , F is not much greater 
than 1. When Gmax approaches Giim , / becomes approximately equal to 
to 2. Thus, even under the most unfavorable conditions, the theoretical 
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noise figure will not be much more than 3 db if there is a significant 
amount of gain. 
The minimum noise figure may also be found from (42), and applying 


first the tuning conditionj7X¢ = —Zy , we obtain 
R (R + Re)? + [Za 7 
fo = ee OO _ — 7 4 
i Re | Za |? 290 —) 


The value of Rg making F a minimum can be found from (49) by the 
standard procedures of differential calculus. The appropriate value of 
Re , represented by Rg”, is 


Ro” = (|Zu[? + RY)”. (50) 


Here Ginax 18, as before, the maximum available gain, obtained with the 
matching value of Re. On comparing (31) and (45), one determines 
that 


| Zu |’ = AR’Giim . | (51) 
Substituting (50) and (51) into (49) gives 


(4Gtim + "+1 7 


Poin = 1+ Grim 390 


(52) 





~., Tf 1 

ae mo Vae tone -), 

The approximate form is valid when Gjim >> 1. Equation (52) indicates 
that a noise figure of close to zero db is theoretically attainable. 

It is also important to determine the maximum gain Gy attainable 
under the conditions leading to minimum noise figure, as Gy may be 
needed to calculate the over-all noise figure of a system using a diode 
preamplifier. To obtain Gy , we substitute into (27) the value Rg = Re,” 
given by (50), together with the value of R, that makes G a maximum. 
This quantity, represented by R,”, is found from (29). In the present 
case it is: 


_ RK’ + RR.” 
R + Rae 
Substituting (53) and (50) into (27) gives Gy . Making use of the defi- 


nition K = VY1 — ZypZn/R? and | Zu = 4R’Giim given in (51), we 
find 


fi,” (53) 


4 Giim 


Ces + R/Re’ Ro’ /R+ EK’ 


(54) 
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Using the relation K = 2(Giim/Gimax)"” — 1 given in (46), equation (50) 
and the assumption Gjim >> 1, we obtain the approximate expression: 














AGiim ee 
Gy = ——— Giim us | 1 . (55) 
2VGhim + }2 (Gm ) 7 : i v 2S Giim 


We now consider tio special cases. For Gmnax sufficiently small, 


. 1/2 . ———s 
(Gm ) SS 1, and in > V/Giim ; 








Ginax max 
Therefore 
Gy & Ginax - (56a) 
For Ginax = Giim 
Gy & 2V7Giim - (56b) 


The equations obtained for the noise figure have been evaluated for 
certain receiver systems, and the results are given in Table I. The exam- 
ples include, respectively, an N-C diode up-converter followed by (1) a 
travelling-wave tube amplifier operating at 3 kme with a noise figure of 
4.5 db; (2) a point-contact superheterodyne converter plus IF amplifier 
stage operating at an input frequency of 10 kme with a noise figure of 
7.0 db; (3) a system similar to (2) operating at an input frequency of 
55 kme with a noise figure of 10 db (a value that may seem optimistic 
but is probably attainable by combining the best point-contact diodes 
with the best 1r amplifier). Values for over-all noise figures of each system 
are obtained from equations (47) and (52), and compared; these corre- 


TaBLeE I—NoIse Figures FoR CERTAIN RECEIVER SYSTEMS 
fe = 160 kme A = 6.0 T = 290°K 





Over-all Noise Figure of System, 
system 





Reecee Output Noise 





System Uhm |G | TE SGBme | TRUS me” 
’ FA FR FA FB 

(1) 3 4.5 1.40 1.40 2.51 2.48 
(2) 10 7.0 1.24 1.20 2.15 2.20 
(3) 55 10.0 1.20 1.14 1.94 2.10 


F4 = noise figure of system corresponding to maximum available gain for diode. 
Fs, = noise figure of system corresponding to minimum noise figure for diode. 
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spond to the conditions of maximum available gain and minimum noise 
figure, respectively. The difference between them is not significant in 
most cases. The calculations were made assuming \ = 6.0, 7’ = 290°K 
and f, = 160 kme, the latter figure being obtainable with recently pro- 
duced diffused silicon diodes. 


VII. CONCLUSIONS 


Equations for the maximum available gain (MAG), and the noise 
figure have been obtained for a nonlinear capacitance diode containing 
parasitics. Because of the series resistance and minimum capacitance, 
the MAG reaches a limiting value as the output frequency increases 
without limit. The thermal noise of the series resistance is also responsi- 
ble for noise output of the diode. In contrast, a perfect variable capaci- 
tance would have its MAG always equal to the ratio of output frequency 
to input frequency, and it would generate no noise. 

The results obtained depend upon the diode cutoff frequency f, , and 
upon the operating conditions through the parameter \. Values of f. up 
to 200 kmc have been obtained with recently prepared diffused diodes, 
hence the use of the value f, = 160 kmc is not overly optimistic. 
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APPENDIX 


Derivation of the Equation for Maximum Available Gain 


In Section V the maximum available gain was obtained in the follow- 
ing form, from (31): 
_ |Z 
RK + 1)?’ 
where K = V/1 — ZpZa/R?. It is convenient to define the quantity x 
such that K = +/1 + 1/22. Hence 


Ginae (57) 


2 R? 
=-7sz 58 
Fe — DypLy a 
After (58) has been solved for R’, (57) can be written 
Be cenh | Zo |? ik ~ | Zor |? 1 | (59) 
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The impedance matrix elements, obtained from (13), are 














1 +E E * 
Zu = 17 
2b +s 
met —P . Gi 
Then 
_ b + Ss 1 _ a 1 
foe ee Laer aGtveei e 
where 
r= hae aRCo (62) 
&(1 + &) 


The quantity Ginax in (61) is the gain obtained with matching termina- 
tions. It depends upon the diode parameters R, and Cinin , and upon the 
operating conditions through the operating frequencies and the Fourier 
amplitudes Cy and C; . The diode parameters are fixed for a given device, 
of course, but Co and C,; may be varied to further maximize G. To demon- 
strate this, we first express the variables Co and C, in terms of Cmax and 
Cin, Where Cinin iS 2 Constant, namely, the lowest value of capacitance 
obtainable. [From (69), below, it can be seen that, for any given value 
of Cmax, the gain increases with decreasing Chin]. We therefore need 
consider only Cymax to be variable. For convenience, in the calculation of 
maximum gain, below, the variable yz is used, where p” = Cmax/Cimin - 

The process of maximizing G is equivalent to minimizing x. To ac- 
complish this we start with (62) and express the quantities é and Co in 
terms of » and Cnin. From (12a) we have 


£703 g 


~@ 1+ 
Since C(¢) is a sine wave, it follows that 
‘Co — Cmin = 2C1. (64) 
Equations (63) and (64) may be used to eliminate Ci , obtaining 
Cy = TES Oi (65) 
Next, we express £ in terms of u. As C(¢) is sinusoidal, it also follows that 
Cmax = Co + 2C1. (66) 


Combining (64), (65) and (66), we obtain 


9 ee aly. 
Cae = (}+2) Crain (67a) 
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or 





Solving for &, we obtain 
sed 
‘ p+l 
Finally, combining (62), (65) and (68), we get 


port min 9 
or 
, = we +) 
p—-l 


The minimum value of A is found to be 
N= (1 + V2)’ = 5.83 
for . 
p= l+ v2. 
The corresponding value of Cimax/Cmin 1S 


Crain 


ae =p = (14+ V2) = 5.83. 





(67b) 


(68) 


(69) 


(70) 


(71) 


Thus, the maximum gain and minimum noise figure are obtained for a 
variation of capacitance over a 5.8-to-1 ratio. In practice, however, the 
dynamic range of capacitance may be no greater than 3 to 1. This corre- 
sponds to p = +/3, and A = 6.5. Hence, for this limited range the per- 


formance would still not be far from optimum. 
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Nonstationary Velocity Estimation 


By T. M. BURFORD 


(Manuscript received January 3, 1958) 


A nonstationary noise may frequently be approximated by the product 
of a stationary noise and a deterministic function of time. From observa- 
tions of the sum of such a nonstationary noise and a linear signal, an esti- 
mate of the rate of change of the signal is found. More exactly, a random 
function, x(t), 7s asswmed to be one of the following: 


a+ bt + g(t)n(t), 


a + bt + g(t) [ h(t — r)n(r) dr, or 


at bt + [nit = DgG)nle) dr, 


where a and b are constants, h(t) 1s the impulse response of a lumped parame- 
ter filter, n(t) 7s white noise and g(t) ts a nonzero deterministic function. 
A least squares estimate of b 1s found as a lincar operation on a finite sam- 
ple of x(t). 


I. WHITE NOISE CASE 


A well-known estimation problem is that of forming-a mean square 
estimate of the constant, b, in the random function x(t) = a + bt + n(d), 
where n(¢) 1s white noise and a and b are unknown constants. The esti- 
mate of b is required to be the linear operation 


= | ee ee 


where K(z) vanishes outside (0, 7’) and 6 is to equal b in the absence of 
noise. The solution is known'” to be 


6 
K(z) =—(f — 22); 0S2sT 
™ (1) 
=. 0; elsewhere. 
1009 
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The above problem will be generalized here to include one type of non- 
stationary noise. 
Let 


x(t) = a+ bt + g(nd), (2) 


where a and b are unknown constants,* n(é) is white noise of unit spectral 
density and g(t) is a nonzero, deterministic function. Formally, it fol- 
lows that 


Elg@n@Og@/)n@)] = gOg@)s — ¢). (3) 
We wish an estimate of b in the form 


6 = [. K(t, t — r)a(r) dr, (4) 


where K(é, z) vanishes for z outside (0, 7’). A constraint is that, in the 
absence of noise, (4) should give b exactly. Therefore 


t 
b= [| Kit-D@t bv) dr, 
t—T 
which implies, with a change of variable, that 
T 
[ KGa & =0, (5) 
0 
and 
T 
[ KGa a= -1 (6) 
0 


Using (8), (5) and (6) and again changing variables we find that the ex- 
pected error is now 


; 
Bb — b)? = I K°(t, dg — 2) dz. (7) 
0 
The minimization of (7) is easily done by the usual variational tech- 
nique, using (5) and (6) as isoperimetric constraints. The variation gives 
2g°(¢ — z)K(t,z) +X + pz = 0. 
Therefore 


—r — pez 
SgE = 2)” (8) 


* The linear a + bé is used here for simplicity. The coefficients of a general 
polynomial may be estimated in a similar way. 


KG, 2) = 
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where A and yu are undetermined multipliers to be fixed by (5) and (6). 
From (5), 


[ Kuda = aan His -4[ “5 7 a 


and, from (6), 
st pe ee eee ee 
2Jo gt,2) 2h PE-—z) ~ 


Solving the preceding equations for \ and yp and substituting into (8) 
gives 


T 
| 2K (t, 2) dz 
0 


Ay = zAg 
Kit, 2) = ———_ 3,3 O8285T 
6) = a — AdeC= a) ° (9) 
= 0; elsewhere, 
with 
7 edz 
A,(t) = I Pa 2)’ 
which gives the minimized error 
eyo Ao 
E(b b) = Avhe SA 


The denominator of K(¢, z) does not vanish, since g(t — z) is assumed 
nonzero, and 


Ay 
Az — Ao 


Ads = AP 0. 


In general, the K(t, z) found here defines a time variable filter over a 
finite time interval. However, if g is a constant, (9) reduces to (1). An- 
other interesting special case is that in which g is assumed to be of ex- 
ponential form, which implies that 


g(t — z) = g(tg(—2), 
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and therefore 
= zi dz =, - Bj 
o a0 g(—z) P(t)" 
Substitution into (9) gives 











B, emery zBo 
tins a IQS < T 
(BB, — BYg(—2’ == 


= 0; elsewhere. 


K(z) = 


Now, however, the B,;’s are constants instead of functions of ¢, as the 
A,’s were. lor an exponential g, therefore, the filter given by (9) is time- 


invariant. 
The function g(¢) is not necessarily continuous. For example, let 


g(x) =; r<B 














= 9; x£> B. 
Then 
1 pi 
Ap 2p 7 
J ge j + 1 ? B 
1 pi ( 1 1 ) a 
= - alee) Pe a ea ba a ea 
ej+it gf grit ‘ 
1 pi 
SS : >t 
gg FA eo 
which implies that K(t, z) is linear but has a discontinuity in value and 
slope at z = 8B. 


Many applications involve a linear g(t). For this case, it is possible to 
plot a one-parameter family of curves which completely describe K (é, z). 
If g(t) equals a + Bt, we define Q = —(a +4 £t)/Bt. Then, by a change 
of variable, 

Hl plo j-l 
Fire ey glen Ean ee oe 
BP Jo Q+s)P BP 


and substitution into (9) gives 
a-a() 


T’K(t, 2) = (CoC — C2) [0 a (7) 
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All of the time dependence is now included in Q. Fig. 1 shows T’K(t, 2) 
plotted against z/T for several values of Q. A positive value of Q implies 
that | g(¢) | is decreasing with time and therefore that recent data are 
superior. Values of Q in (—1, 0) are not permitted, since such a Q would 
imply that g(t) vanished somewhere in (¢ — 7’, ¢). If g(d) is constant, Q 
is infinite and (1) applies. 



































T? K(t,Z) 





















































O- 0.2 0.4 0.6 0.8 1.0 
2/7 


Fig. 1 — Weighting functions, K(é, 2), for linear g(t). 
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The K(é, z) just found for the linear g(¢) case may be substituted into 
(7) to give the variance of 6 as a function of Q: 


fue ae E 2 (toe a 1) Q(Q + »| (10) 


An interesting comparison may be made with the variance resulting 
from using K(z) of (1) in (7), which gives 





ait — i = 30041 +2. a) 


Equation (11) shows the error to be expected from using a K(a2), 
which is optimum in the stationary case, on this type of nonstationary 
data. 

Equations (10) and (11) are plotted in Fig. 2. The difference between 
the curves is a measure of the improvement possible* with time-variable 
smoothing. 

As Q approaches ~, (10) may be written 


i, ae 2 7 LNs, 
pH — b)* = 3020 +1) i +0(5); 
therefore (10) and (11) will asymptotically differ by 16/5. 
Tig. 2 is plotted for Q > 0; however, it may be used for Q < —1 since 
both (10) and (11) are even on either side of Q = —0.50. 
Fig. 1 also suggests a simple approximate realization for K(é, z). The 
several curves have approximately common intersections near z/7' = 0.2 
and 0.8. Therefore, A(t, z) might be represented as 


where K,(z) is one of the K(t, z) curves near the middle of the range of 
Q’s to be considered. The function K2(z) vanishes near the points 
2/T = 0.2 and 0.8 and takes on relatively small values elsewhere. This 
approximate realization thus involves two time-invariant filters and a 
multiplier which depends on Q and thereby on é. 


II. PRE-FILTERING OF NOISE 


Filtering and multiplication by g(¢) are not, in general, commutative 
operations, so a choice must be made between 


g(t) a _ Mt = *)n(e) dr (12) 


* It has been shown by E. N. Gilbert that, even if a linear g(t) vanishes in 
the smoothing interval, the best possible estimate has variance 6?/T and, there- 
fore, perfect estimation is not possible. 
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Fig. 2 — Variance for linear g(é). 


and 
is h(t — 1r)g(r)n(r) dr (13) 


as noise functions to be added to a + bé. In certain cases, the representa- 
tions (12) and (13) are virtually equivalent in the sense that, given 
h(t — 7), there exists an h(t — 7) such that 


gd) [he = 2)nGe) dr = [fle — rgl)nG) ar. 


Trom the point-wise uncorrelation of n(7), the preceding equation, if it 
is true at all, must be true for all values of 7. Therefore 


h(t — 7) = 9) ny — 1), 
g(7) 
which implies that g(¢)/g(7) must be a function of (¢ — 7). The only 
nonconstant g(¢) satisfying this condition is the exponential. If g(¢) is an 
exponential, the choice between (12) and (13) is arbitrary. In general, 
however, the choice is not arbitrary and depends on physical considera- 
tions. The forms (12) and (13) will now be treated as Cases 1 and 2. 
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Case 1 
The observed quantity, x(é), 1s 


t 
x(t) =a + bt + g(t) i h(t — r)n(r) dr, 
where n(r) is white noise of spectral density unity. The transform of 
h(é) is assumed to be 


x 
T(h(t)] =————-; m<n. 
> 


It is also assumed that the numerator and denominator of T|h(¢)] have 
neither common factors nor multiple roots. The covariance of 


t 
[ne = 2)n(e) ar 
is defined to be p(7). A particular K(¢, z) is sought such that 
t 
b= | Kea aiaevar 

t—T 

minimizes 
“3 T z 
E(b — b)’ =| dz g(t — z)K(t, a | dz’ g(t — 2’) K(t, 2’)p(z — 2’). (14) 
0 0 


In obtaining (14), it was assumed that, in the absence of noise, b 
equals b, or 


[ xe z) dz = 0 [ -xe z)dz= —1. (15) 


Using the constraints of (15), the variation of (14) gives the integral 
equation* 





7 , , , , d + M2 
dz’ K(t, z')g(t — z)p(z — 2) = ———. (16) 
0 g(t iF, z) 
From the form of 7[h(Z)], it is evident that 
, — 1 - M(w) tw(z—2’) 
al a Nl) * Si 


* A similar equation, of somewhat different origin, has been considered by 
Ule.? 
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where M(w’) and N(w’) are polynomials in w” of degrees m and n. Now 
form’ the operation N(—d’/dz’) and apply it to (16), giving 


i [axe z)g(t — 2’) i : Mw)???” dw = N (-4) 
Qr 0 : — dz? 
which is, formally, the same as 


1 d [ ee oie een ae (-4)- + 
ar a 0 dz’ K(t, 2’)g(t 2’) ae dw = N da g(t — 2)’ 


or, the differential equation 


a A+ pz 
M (-4) Kit, 2g — 2) = v(- £) Ck ae (17) 


The general bounded solution of (17) will not ‘necessarily satisfy (16), 
however. As is well known,’ singularity functions up to order’n — m — 1 
should be added at the end points of the interval in order to satisfy the 
boundary conditions of the integral equation. (16). 

Therefore, let 


KG, 2g — z) = S aj?” +[ k(z — N(- \C ES ex 


n—m—1 —m—1 
+ >) 06%) + >> c6 (2 — T), 
0 0 


A+ pz 
g(t — 2)’ 





(18) 


5° 5 


where is the Dirac delta function, its derivative, etc. The func- 
tion k(z — x) is the Green’s function associated with M(—(d’/dz")) and 
the 2m constants m, are the roots of M(—X°). The a,;, b; and ¢; com- 
prise 2n undetermined constants. They may be determined by substitut- 
ing (18) into the integral equation (16) and requiring an identity in the 
2n exponentials of p. For example, assume h(x) = € “, which implies 
that 


M (w’) _ 1 
N (w?) ~ Qt a?’ 
Then 


KU, aot — 2) = (a — SYA + nate) + cale - 1, 
and substitution into (16) gives the following equations for bo and c : 


= Lo A + pz = ad A + pz 
bo = (a as |. = ” (0+ 22+ 








= T. 
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Except for evaluating \ and u from (15), the solution is 
Kt, z) = 


1 2 a d d\|\ + ne 
seal ~ dat too - g) +e - a+ 3) ees. 


Case 2 





The observed quantity «(¢) now is 
; t 
ait) =a+bt+ [ h(t — r)g(r)n(r) dr. 
We define b, K(t, z) and ht — r) as in Case 1 and again require that» 
in the absence of noise, 6 equal b, or that the constraints (15) hold. 
These assumptions lead to the following integral equation: 
T 
| dz’ K(t, 2’)@(t — z,t — 2’) = A+ pz, 
0 
where 


a(t — 4-2) = I hy)h(y + 2’ — 2gXlt — 2! — 9) dy. 


Instead of dealing directly with the integral equation, we first rewrite 
x(t) as 


wo) = [Re = 218 + 47 + g(e)nG)] ar, (19) 
where 
5 
Y Bo” 
and 


since it is easily shown that 


i h(a) dx = Bo ; 
0 Qo 
and 


I ei) ae = PI 
0 


? 
Qo” Qo 
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from the assumed form of 7 [A(é)], and that 


[ na-n6+ 9) a= 


Bf n(x) de + yt [ h(v) de — y | h(x) dx = a + bt 


when the values of 8 and y are substituted from (19). 
A function y(t) is now defined as 
y(t) = B+ vt + gn 
and a function K(é, z) is defined as the convolution of K(é, z) and h(z), 
so that (14) is replaced by 
b= f : R(t, t — nyo) dr. 


The function y(¢) is similar to x(é) as defined in (2). Therefore, K(é, z) 
may be found in a manner analogous to that used in the case in which 
the noise was not pre-filtered. Recalling that K(t, z) is required to 
vanish outside (0, 7’), we may define K(é, z) more exactly by 


K(t, 2) = i dvK(t, vh(e —v); 0O<2<T 
0 


T 
= i dvK(t, vh(ze — 0); 2>T (20) 
0 
= 0; z2<0, 
or the equivalent: 
n a? = m di 
Do 03 5 RW 2) = 2B 7 Kb 2) 0<2<T (21) 


= 0; z2>T7, z2<0. 


Since K(é, 2) may be discontinuous at z = 0 and 7, these points have 
been excluded. While K(¢, z) is constrained by (15), it follows that 
K(t, 2) must satisfy 


i K(t, z) dz = 0 


(22) 


[ 2K (t, 2) dz = Bo 
«0 


Qo 
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The variance to be minimized is now 
Ho - = | RG ag - 2 de 
From (21), 
K(t, 2) = > sels 2 > T, (23) 


where the s; are presently unknown but will be selected to minimize 
E(b — b)’ and the ¢; are known in terms of the a;. A variation of 


[ [K(t, z)g°(t — z) + AK(t, z) + weK(t, z)] dz 


gives 
2K(t,z)\g(t —2) +X tye =0; O<2<T (24) 


and a set of n equations defining the s; : 
n-1 oo) 0 
278 : FOP (t — 2) dz + i (\ + peek? dz = 0, (25) 
7=0 T T 


where 
k=0,1,-:-,n—-—1. 


Equations (23) and (24), together with the constraints (22), define 
K(t, z) completely. The function K(t, z) may now be found from (21). 

Irom the discontinuities in K(¢, z) at z = 0 and T, delta functions 
of order as high as n — m — 1 may be expected in K(f, z) at z = 0 and 
T. For example, if h(a) equals € *”, (21) becomes 


ee + aK(t,z) = K(t,z2); 0<2<T 
= 0; z2<0, ae 
and 
Ki, 2) = se”; 22> T. 
From (24), 
_ —A—pzZ 
Kt, 2) gE! 0<2<T 


and, from (25), 


2s [ e “g(t — z) de + / (A + uzle ’ dz = 0, 
T T 
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which defines s and therefore K(t, z) for z > 7. Except for evaluating 
dh and yp from (15), the solution is 


Kh.) = Ee ee ee tree a] (Ai) 


g(t — 2) 
+ u(r +3) 
caer 7/75 0 ae T), 


where 


Hii, T) = &" / e “g(t — z) dz. 
T 
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Amplitude Modulation Suppression 
in FM Systems 


By C. L. RUTHROFF 
(Manuscript received January 8, 1958) 


Inadequacies of some limiter concepts are discussed and a lamiter analy- 
sis an terms of low-index modulation theory is presented. The analysis of 
a particular circuit proves the possibility of perfect amplitude modulation 
suppression with small loss to the frequency modulated signal. Experimen- 
tal data are presented which verify the theory and demonstrate the practt- 
cability of the device analyzed. 


I. INTRODUCTION 


Since the early days of frequency modulation (Fm) it has been known 
that to realize the full advantage of rm with respect to signal-to-noise 
ratio it is necessary to suppress any amplitude modulation (am) which 
may be present on the signal. Various kinds of limiters have been used 
for this purpose: overloaded amplifiers, grid and plate clippers and 
vacuum diode clippers, to mention a few. In recent years, the trend has 
been to use germanium or silicon diodes for this function, particularly 
in broadband systems such as the TD-2 and TJ microwave systems and 
in systems using transistors. 

There is some ambiguity in the use of the word “limiter.”’ In this 
discussion a limiter is a device whose function is to reduce the index of 
amplitude modulation of the input signal. This function is called limit- 
ing or AM suppression and the amount of limiting is the ratio of the 
index of Am in the output signal to the index of am of the input signal. 

The operation of limiters has been only partially understood and as 
a consequence the operation is not efficient either in the sense of loss 
to the FM signal or in the amount of amplitude modulation suppression 
(limiting). 

If it is recognized that limiting is a modulation process, it becomes 
clear that modulation theory can be used to analyze limiter circuits. 
This is done here for a diode limiter. Theoretical and experimental 
curves are presented, as are equivalent circuits. 
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II. GENERAL 


Before launching into the modulation theory of limiters, it is first 
necessary to point out some of the inadequacies of some current con- 
cepts. 

It might be surmised that, if the carrier output voltage is plotted as 
a function of the input voltage, the amount of limiting is given by the 
slope of this curve. In Appendix B it is shown that this is not necessarily 
true. In particular, it is demonstrated that the limiting characteristic 
can be altered without altering the carrier transfer function. 

A common “clipper” limiter and its dual are shown in Figs. 1(a) and 
1(b). The customary explanation of limiting by the circuit in Fig. 1 (a) 
is accomplished with the aid of Fig. 2. The diodes are assumed ideal in 
the sense that the back resistance is infinite and the forward resistance 
is zero. The bias current J) is assumed to come from a constant current 
source and to bias both diodes in the forward or low-resistance direction. 
The input signal voltage e shown in Fig. 2(a) is a sine-wave amplitude- 
modulated carrier. The output voltage across r; follows the input volt- 
age until the input voltage reaches the clipping level a, Fig. 2(b). At 
this point, diode Dp; switches to its back resistance and further increase 
in input voltage does not appear in the output, since the back resist- 
ance of D; is infinite. Now, when the input voltage is reduced again 
to value A, D; switches to its forward resistance and the output voltage 
is again a replica of the input. A similar sequence is followed on the 


D; A D2 






Ry 
Io 
CONSTANT 
CURRENT 


Ry (a) 


CONSTANT ) = 
VOLTAGE 


Fig. 1 — A common idealized limiter circuit (a) and its dual (b). The diodes 
are pee to have zero forward resistance and infinite back resistance. 
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negative half of the cycle except that, in this case, it is the diode De 
that switches at level —a. The net result is an output voltage of the 
form of Fig. 2(c). If the clipping Jevel a were reduced and made to 
approach zero, only the zero crossings of the original signal would ke 
preserved and the amplitude modulation would be completely sup- 
pressed. 

The above explanation is useful for some purposes, but it has these 
serious limitations: 

1. It is not obvious from Fig. 2(c) that any am remains in the signal 
when, in fact, there may be a considerable amount. The fact that the 
usual envelope structure has disappeared does not prove the absence of 
AM. 

2. The output is as shown in Fig. 2(c) only if the limiter circuit has 
a bandwidth which includes many harmonics of the carrier frequency 
and the baseband frequencies of interest. These harmonics are not negli- 
gible in amplitude and have important effects on limiting. In most 
practical circuits only the fundamental is passed and the simplified ex- 
planation is not accurate enough. The effects of preventing the flow of 


(a) 
INPUT SIGNAL 


CLIPPING 
LEVEL 


(b) 
INPUT SIGNAL 
SHOWING 
CLIPPING LEVEL 





A 
(Cc) 
OUTPUT SIGNAL 
-A 


Fig. 2 — Input and output signals for the limiters of Fig. 1. 
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A=——— FUNDAMENTAL, B= 35K 
B—-— 1ST & 3RD HARMONICS, B = 35K 
1ST 3RD & 5TH HARMONICS,B = 35K 
D—--— ALL HARMONICS, B = 35K 
E ALL HARMONICS, B= oo 



































LIMITING IN DECIBELS 











SOURCE 


DIODE FORWARD’ RESISTANCE =20 
DIODE BACK RESISTANCE= Bn 
Ry ,=1000N I9=1073 AMPERES 

















=6 -4 —2 0 2 4 6 B i 2 14 #4 18 
AVAILABLE INPUT POWER IN DBM 


Fig. 3 — Calculated data for “clipper” limiter passing various harmonics of 
the carrier frequency. 


harmonics of the carrier are shown in Fig. 3. This figure gives limiting 
as a function of available input power level. As mentioned previously, 
limiting is defined as the ratio of amplitude modulation index of the 
output signal to that in the input. Curve a shows the usual practical 
case, i.e., where only the band of frequencies about the carrier and the 
baseband frequencies are allowed. All harmonics of the carrier and side- 
bands are suppressed. Curves B and c include all harmonics up to the 
third and fifth respectively. Curve p includes all harmonics of the car- 
rier and sidebands. Curve £ is for diodes with infinite back resistance. 
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This is the case usually analyzed. These curves were calculated from 
formulae derived in Appendix A; the circuit constants used are indicated 
on the figure. 

3. The third limitation has to do with the assumptions concerning 
the diodes. From Fig. 2(c) it can be seen that, if the clipping level is 
reduced (this is equivalent to increasing input amplitude and keeping 
clipping level constant), the AM suppression increases, as does the loss 
at the carrier frequency. In the limit, the am suppression is perfect and 
the loss to the carrier is infinite. It is commonly thought that, for this 
reason, increasing the input level increases the am suppression. In a 
practical limiter this is not true. One important reason is that the back 
resistance of a diode is never infinite. In Fig. 4, which is similar to Tig. 
2 except that the diodes are assumed to have a finite back resistance, 
note that the output waveform is slightly rounded instead of being 
clipped sharply. The portions are rounded because, even when one diode 
is on its back resistance, the input voltage influences the output voltage. 
In fact, during the clipping interval the output waveform is simply a 
greatly reduced replica of the input waveform in that interval. Now, if 
the clipping level is reduced to zero, the output waveform is exactly 
the same shape as the input waveform. At every instant, one diode is 
on its back resistance and the other on its forward resistance or, in 
other words, the circuit is lear and there is no limiting whatever. The 
loss, however, is finite. This means that there is some one input level at 
which the limiting is best and that, contrary to common belief, merely 
increasing the input level does not necessarily improve the limiting. This 
is also illustrated in Fig. 3. 

For these three reasons, the simple clipping picture does not give a 
suitable explanation of the limiting action. A more accurate description 









INPUT SIGNAL ~~ _ 
OUTPUT SIGNAL. 


—-—— CLIPPING 
LEVEL 


Fig. 4 — Input and output signals for a clipper limiter using diodes with finite 
back resistance. 
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is given by analyzing the circuit in terms of modulation theory. To 
justify the modulation theoretical approach, a simple argument can be 
given. 

A simple example of an Fm signal is a sine-wave modulated carrier 
with low index of modulation. If the index is sufficiently low, only the 
first pair of sidebands is important. The only difference between this 
FM Wave and an AM wave modulated with the same signal is in the phase 
relationships between the sidebands and the carrier frequency. An AM 
wave can be superimposed on the rm wave so that the am sidebands 
occur at exactly the same frequencies as do the rm sidebands. This 
situation is illustrated as follows: 


Am: (1 + k cos pt) cos wt = FE cos wt 


kE 


5 [cos (w + p)t + cos (w — p)él, 


+ 





FM: I cos (. + : sin pt) & FE cos ot (1) 
+ nS [cos (w + p)t — cos (w — p)él, 


pet “« i: 


where Aw is the peak deviation of the carrier 
p is the modulating frequency 
w is the carrier frequency 
k is the Am index of modulation. 
The composite FM signal with Am present is: 


Ei cos wt + pee [cos (w + p)é — cos (w — pil 
2p 

a (2) 

+ ks [cos (w + p)t + cos (w — p)dél, 


where the Am and Fr sidebands can be identified from (1). 

The purpose of the limiter is to eliminate only those sidebands repre- 
senting AM, i.e., the second pair in (2). Clearly this cannot be accom- 
plished by any linear circuit, since whatever happens to the am side- 
bands happens also to the Fm sidebands, which are at the same 
frequencies. The limiter must: accomplish two things. First, it must 
identify the am sidebands; second, it must eliminate them. And it must 
do these things without interfering with the rm sidebands. 

The identification can be made with an amplitude detector. Such a 
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detector does not interfere with an rm wave but does detect whatever 
AM is present. The output of the detector can then be used to generate 
a pair of sidebands in the output circuit similar to the original am but 
opposite in phase. If care is taken to adjust the amplitude properly, 
these remodulated sidebands will completely cancel the original side- 
bands, and the output signal will be free of am. It is important to note 
that neither the detection nor modulation affects the Fm signal except 
to introduce some loss. The output of the limiter contains only the 
desired FM signal. 

This argument, although heuristic, indicates three important results. 
It shows that limiting is necessarily a nonlinear process. More important, 
it indicates the possibility, at least theoretically, of perfect limiting using 
diodes with finite forward and reverse resistances and with a finite loss 
to the Fm signal. In addition, it points the way to a mathematical ap- 
proach using low-index modulation theory. 

The limiter configuration of Fig. 5, based on the principles discussed 
above, has been analyzed and the details are included in Appendix A. 


BAND DIODE | DIODE 2 BAND 
PASS PASS 
FILTER FILTER 


LOW BAND 
PASS PASS 


LOW FILTER FILTER LOW 


PASS PASS 
FILTER FILTER 





MESH 1 MESH 2 
DIODES ARE BIASED IN CONDUCTING DIRECTION (LOW RESISTANCE) 


Bane SHORT CIRCUIT AT CARRIER AND SIDEBAND FREQUENCIES 
FILTER| OPEN CIRCUIT AT ALL OTHER FREQUENCIES 


pee SHORT CIRCUIT AT BASEBAND FREQUENCIES AND DC 
FILTER| OPEN CIRCUIT AT ALL OTHER FREQUENCIES 


Cy, Ca AMPLITUDE OF CARRIER CURRENT IN MESH 1,2 
S;,S2 AMPLITUDE OF SIDEBAND CURRENT IN MESH 1,2 
B,,Ba AMPLITUDE OF BASEBAND CURRENT IN MESH 1,2 
I, , 1a AMPLITUDE OF DIRECT CURRENT IN MESH 1, 2 


é=E [cos wt + k/2[cos(wt+p)t + cos (w —p)t]] 


1S THE INPUT AM SIGNAL VOLTAGE WITH INDEX OF 
MODULATION kK AND MODULATING FREQUENCY )/277 


Fig. 5 — Limiter circuit with which perfect limiting can be obtained. 
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Rp = 10K OHMS 
Ry = 1K OHM 
Re = 4.5,5, 

6K OHMS 
B = 35K OHMS 
A = 20 OHMS 





V = 10.7 VOLTS 

















Sp 
k/2 Co 


























LIMITING IN DECIBELS, 26 L069] 


























2 
AVAILABLE INPUT POWER IN DBM, La 
L 


Fig. 6 — Calculated results for limiter of Fig. 5. 


The limiter of Fig. 5 is similar to that of Fig. 1(a), with three important 
differences. In Fig. 1(a), the impedance in branch a-B is infinite at all 
frequencies; in Fig. 5, this impedance is finite at all frequencies of inter- 
est. Also, the circuit in Fig. 5 is arranged so that the resistance R, , at 
frequencies in a band centered at the carrier frequency, can be controlled 
independently of the resistance Ry at baseband frequencies. (The diodes 
are assumed to have finite resistance.) In addition, only two bands of 
frequencies are allowed: a band including the carrier and sideband fre- 
quencies and another band extending from de up to the highest baseband 
frequency of interest. 
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The resistances R, and R, play a vital role in the operation of the 
limiter. By properly proportioning these resistances, the limiting can be 
made theoretically perfect at one or two input levels. If good diodes are 
used (diodes with a high ratio of reverse to forward resistance; most 
germanium and silicon diodes are good by this definition) perfect limit- 
ing can be achieved when Ry, = R,. From formulae derived in Appendix 


W.E.404B 
DIODES 





CARRIER FREQUENCY 1MC 
MODULATING FREQUENCY 1KC 
INDEX OF AMPLITUDE MODULATION=20% 





OUTPUT POWER IN DBM 

















LIMITING IN DECIBELS 























~16 S12 ~8 -4 ie} 4 8 
AVAILABLE INPUT POWER IN DBM 


Fig. 7 — Experimental results for practical limiter based on Fig. 5, with R, 
adjusted for one limiting peak. 
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A, several curves of limiting versus input carrier level have been calcu- 
lated and are plotted in Fig..6. For all the curves R, = 10,000 ohms, 
but F, is different for each curve... or all such data, limiting is defined 
as the ratio. of the Am.index. of ‘modulation of the output to the index of 
modulation of the input signal. In Fig. 6, the curve for R, = 4500 ohms 
shows two points of perfect limiting, and this behavior is verified in 
experimental data presented in Figs. 7 and 8. It is possible to get either 
one or two peaks, depending upon the value of R,. 

The theory indicates that, if two points of perfect limiting exist, the 
phase of the am sidebands reverses at the first point and again at the 
second. That this actually occurs has been verified by experiment. 

Figs. 9 and 10 show similar results obtained for the condition R, = 
R,.. Where possible, this is the preferred condition since it requires less 
by-passing of R, and R,, a problem which is especially important in 
broadband systems. 








OUTPUT POWER IN DBM 








LIMITING IN DECIBELS 




















—-16 =l2 -8 -4 ie} 4 8 
AVAILABLE INPUT POWER IN DBM 


Fig. 8 — Experimental results for practical limiter based on Fig. 5, with R, 
adjusted for two limiting peaks. 
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: : 
5 6 > 600 


Re = 22,5000 
V = 24v 
fog = 1100 KC 
INDEX OF MOD = 20% 
MODULATING FREQ = iKC 


























LIMITING IN DECIBELS 
OUTPUT POWER IN DBM 

















AVAILABLE INPUT POWER IN DBM 


Fig. 9 — Experimental results for practical limiter based on Fig. 5, with 
Ry = &, and adjusted for one limiting peak. 


Figs. 6 through 10 indicate that perfect limiting occurs only at iso- 
lated values of input power for this limiter. No practical difficulty arises, 
however, because perfect limiting can be approximated as closely as 
desired over any finite range of input power levels by cascading several 
stages similar to Fig. 5 and arranging the limiting peaks properly. In 
Fig. 6 the hump between the two peaks of perfect limiting can be re- 
duced to any desired level by increasing R,. This will have the effect 
of reducing the separation of the peaks. After choosing any finite amount 
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Fig. 10 — Experimental results for practical limiter based on Fig. 5, with 
R, = R,. and adjusted for two limiting peaks. 


of limiting, R, can be adjusted so that the limiting at the maximum 
point of the hump exceeds this value. (The minus sign is usually omitted 
and we say that 40 db of limiting is more than, or exceeds, 30 db, i.e., 
the decrease in AM index is greater for 40 db of limiting than for 30 db.) 
Then, for a continuous range of input levels, the limiting is more than 
the desired amount. Another stage can be adjusted so that its limiting 
peaks fall near but to the right of those of the first stage, in such a way 
that the limiting between the peaks always exceeds the predetermined 
number. Control of bias current and FR, will be required for this adjust- 
ment. If this process is continued it is evident that perfect limiting can 
be approximated as closely as desired over any finite range of input 
levels. 

In many practical situations, one stage is sufficient because the acc 
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FILTER 
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(a) EQUIVALENT CIRCUIT SHOWING COUPLING BETWEEN THE CARRIER 
AND OC FOR THE LIMITER OF FIG. 5 
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2 - a ae 


‘ 


‘ y¥ 













BAND 
PASS 
FILTER 

























LOW 
PASS 
FILTER 


LOW 
PASS 
FILTER 





(b) EQUIVALENT CIRCUIT SHOWING COUPLING BETWEEN SIDEBANDS 
AND BASEBAND FREQUENCIES FOR THE LIMITER OF FIG.5 


Yo=(B-A) A, +A Ro =(B-A) BatA 
Tn = 2(B-A) SINAAI7 Rn=2(B-A) SIN neat 


B= BACK RESISTANCE OF DIODE 
A= FORWARD RESISTANCE OF DIODE 


Fig. 11 — Equivalent circuits for limiter of Fig. 5. 
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of the iF amplifier acts to keep the carrier input level near the value for 
best limiting. 


III. EQUIVALENT CIRCUITS 


From equation (13) derived in Appendix A, equivalent circuits for 
the limiter of Fig. 5 can be drawn. These are shown in Fig. 11. Fig. 12 
is the dual of Fig. 5. In Fig. 11 all resistances with numbered subscripts 
are coefficients in the Fourier expansion of the resistance function of the 
diodes; lower case 7r’s indicate diode D;. Resistances with letter subscripts 


are as shown in Fig. 5(a). 
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OF MODULATION kK AND MODULATING FREQUENCY p/277 


Fig. 12 — Dual of limiter circuit shown in Fig. 5. 
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The limiting mechanism can be understood by consideration of the 
circuit involving Am sidebands [Fig. 11(b)]. Some Am sideband energy is 
coupled from the input mesh 1 to the output mesh 2 through FR, . Some 
of the remaining sideband energy is coupled to mesh 3 through the 
r;/2 where it appears at the baseband frequency. This baseband energy 
in mesh 8 is coupled in turn through —R,/2 into mesh 4, where it is 
still at the bascband frequency. From mesh 4 it is coupled through R,/2 
into the output mesh 2, where it appears at the sideband frequencies 
in phase opposition to the energy coupled through R,. By properly 
proportioning FR, and R,, the energies coupled by R, and —R,/2 can 
be made to cancel, leaving no sideband currents in the output mesh 
and therefore achieving perfect limiting. The rm signal behaves in the 
same manner as the carrier and suffers loss only in passing through the 
limiter. 


APPENDIX A 


The material in this section is based on Refs. 1 and 2. In Ref. 1 
Caruthers develops the superposition principle for low-index modulation; 
i.e., he shows that circuits containing nonlinear resistive elements can 
be resolved into equivalent linear systems using a large carrier and 
small signal amplitude. In Ref. 2 it is shown that a nonlinear resistance 
can be represented by a Fourier series, the coefficients of which have 
the dimensions of resistance. If the carrier is large compared to the sig- 
nal, this resistance is a function of carrier frequency and harmonics 
only and, to a good approximation, is independent of the signal. 

Based on these ideas, the limiter circuit can be solved by usual linear 
circuit techniques. 


A.1 Superposition 


Any nonlinear resistance has a current voltage characteristic that 
can be expressed by a power series of the form: 


e = af + at” + at’ + -+- + ag™ + +e: (3) 


If a carrier and signal current are applied to this nonlinear resistance, 
each term after the first will produce voltages at new frequencies con- 
sisting of the intermodulation products of the carrier and signal. If 
finite external impedances are presented to these voltages, currents will 
flow at these new frequencies and produce still more new frequencies. 
The number of currents can be minimized by presenting infinite im- 
pedance to some of the modulation products. 

If a carrier current C cos ct and two signal currents S; cos s¢ and 
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Se Cos Sot are applied to the nonlinear resistance the voltage due to the 
nth term of (8) is 


C, = A,(C cos ct + S; cos sit + Se cos sot)” 


n (4) 
a oe Oi (cos ct + et COS sit + * cos st) ' 


We now make the usual assumption of low-index modulation theory. 
This assumption is that S,/C and S,/C are small enough so that 
terms in the expansion of (4) containing products and powers of S,/C 
and S:2/C are negligible. The expansion of (4) is then: 


C, 2 A,zC” (cos ct + ncos” ct _ cos sf + n cos” ct * cos st) 3 5) 
The first term in (5) is the de and the harmonics of the carrier. The 
second and third terms contain the input signal, the output signal (usu- 
ally the first order sidebands) and other unwanted modulation products. 
The important thing to note about (5) is that the input and output 
voltages (second and third terms) are the same whether the input signals 
are applied separately and the outputs summed or whether the inputs 
are applied simultaneously. Therefore, the superposition principle holds 
and is a direct consequence of the fact that the carrier amplitude is 
much greater than the signal amplitude. 


A.2 Fourter Series Representation of Nonlinear Resistance 


The power series expansion for the nonlinear resistance as a function 
of current can be obtained by differentiating (8) with respect to 7. The 
nonlinear resistance is: 


rt) = . = 0, + Qat + Ba +--+ nAw t+, (6) 


where r(z) is the nonlinear resistance. 

The usual assumption made in low-index amplitude modulation theory 
and the one made here is that only the carrier current is significant in 
determining the nonlinear resistance. If the carrier current 1 = J cos ot 
is substituted into (6) and the series rearranged, the result is a Fourier 
series. Such a series can be written as 


r(t) = To + Dor, cos nwt. (7) 
- n=l 


If the current-voltage characteristics of the nonlinear element were 
expanded in terms of voltage instead of current, an analogous analysis 
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would lead to a representation of the element as a conductance: 
. et 
g(t) = go + Do gn Cos not. (8) 
n=1 


Either (7) or (8) can be used to represent a nonlinear resistance in low- 
index modulation problems. The accuracy of the representation depends 
upon the index of modulation; the lower the index, the better the repre- 
sentation. 


A.3 Limiter Analysis 


With superposition established and a simple expression for the non- 
linear resistance, the rest of the analysis can be done by usual linear 
circuit techniques. The procedure is as follows: 

i. An amplitude-modulated sine wave voltage is applied to the circuit 
of Fig. 5. No rm need be included, since it is not affected in any way by 
the limiter except to suffer some attenuation. 

ii. A solution is assumed; i.e., currents at certain frequencies are 
assumed to flow in the two meshes. The magnitudes of these currents 
are unknown and are to be determined. Currents at all other frequencies 
are not allowed to flow. 

iii. The mesh equations are set up and solved for pertinent currents. 

With reference to Fig. 5, the input voltage F is a carrier, amplitude- 
modulated with a sine wave: 


e= Ecosat + fH [cos (w + p)t + cos (w — p)il, (9) 


where k = index of modulation 
w/2nr = carrier frequency 
p/2x = modulating frequency. 
The currents that are assumed to flow are: 


1, = C, cos wt + S, [cos (w + p)é + cos (w — p)t] + Bicospi+h, 
dg = C2 cos wt + Se [cos (w + p)t + cos (w — p)t] + Bocospt+ Ir. 


The circuit of Fig. 5 is arranged so that no other currents are allowed 
to flow. With these currents the 7R drops across the nonlinear elements 
can be calculated. Let the diodes have the following expansions: 


(10) 


Diode Dy: 71(t) = 7 + > Tn COS Nut, 
n=1 
: (11) 
Diode be: ro(#) = Ro + >) (—1)"R, cos not. 
n=l 
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The factor (—1)” in re(é) is due to the fact that the diodes are poled in 
opposite directions. The voltage drops across the diodes are found by 
multiplying (10) and (11): 
: T2 
my = ari) = C1 (~ + =) COS wt 
rs = 5 ae * B,| [cos (w + pt + cos (w — p)t} 


+ (7B, + 781) cos pt + 5 C, + Liro + Tin cos ot, 


(12) 
Ve = ter(t) = Ce (® + sey cos wt 
Re Ry 
+ | Bos: ace ae | [cos (w + p)t + cos (w — p)e] 


NciGe Th Ga PO) eo pio “i Ca RR eoa.at 


The next step is to equate the voltage drops at each frequency to the 
generator emf’s in each mesh: 


iH Rai = RCo + Tyr, 
0 = —R.C, + RC, -— [2h 


| carrier 


—~V= 3 OA hiGee Rp STR: 





de 
V=—32C. + Lo + Rs) — LR, 
(13) 
ue = RnSi — ReSs + By 
sidebands 


0 — —RS, + RS am “ Be 


0 = Si + (% + Rv) By — RyBe ) 


| baseband, 
0 = —R,S, — RoBi + (Ro + Re) Bo 
where 


Rn = tot 5+ Ro + Bi 


Re = Ro+ + Re + Rs. 
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The set of equations (13) can be solved for the pertinent currents: 


C= 


E {Rl(r + R:)(Ro + Ro) — Rs'] — = (ro + re)} 





2 
7 (Raton SiG Ry ns) 











2 
D ? 
C, = 
E {Rellr + Ro)(Ro + Rs) — Re'] — * rakes) (14) 
2 
+V (Ratti + nRRo — oe 
Si {Ral + R»)(Ro + Ro) — Rs] — nie 
So = D ») 
where 


2 
= [(ro + Ra)(Ra + Ra) — Re'\(Rulee — RE) —  Realry + Ba) 


 R(Ry i Rs) + nhih-R, + (ny 6 


The insertion loss ratio is 











J gut = CYR: 
Yap i? (15) 
4Ry 
and the limiting, or amplitude modulation suppression, is 
a eee 
limiting = eS (16) 
5 


Expression (16) shows that perfect limiting occurs when S. = 0. The 
numerator of S. as given in (14) indicates the possibility of perfect 
limiting if R, and Ry, can be adjusted independently. 

At this point it is convenient to place some further restrictions on the 
diode characteristics so that the calculations.can be made more easily. 
The diodes are assumed to switch from a constant forward resistance A 
to a constant backward resistance B. The fraction of the carrier cycle 
that the diodes remain on the back resistance is designated 81, Be. 
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With these simplifications, the diode characteristics given in (3) are: 


r(t) = ro + >> rp COS Nut 
n=1 
; (17) 
ro(t) = Ro + 2, (—1)"R, cos nut, 
n=1 


where 
= (B — A)f&i + A, Ryo = (B — A) + A, 


‘= 2(B — A) sin nBiT, R,, = _ 2(B — A) 


sin nBer 
nT nT , 


B = back resistance of the diode, 

A = forward resistance of the diode, 

8 = the fraction of the cycle during which the diode is on its 
back resistance. 

All of the currents in (14) are functions of Z, 8, and 82, where F is the 
amplitude of the input carrier voltage. In order to make calculations 
for (14), relations among LF, 8; and 8 must be derived. 

When the input to the limiter is very low, both diodes are on their 
forward resistances, due to the forward bias current Jo. At these low 
levels 8; , G2 are both zero and, from (14), 

C2 R. 
GO. Ra By ) 

Because the circuit is resistive and because only the carrier current 
is involved in switching the diodes, (18) holds for all instantaneous loop 
currents less than J). From the definition of 8, the diodes switch to 
their back resistances when 


C*; cos (—£i7) => Io, C; = Io, 
C, cos (— Bor) = Io, Cs = ie 


At the switching point, both (18) and (19) hold. Therefore, by substitut- 
ing (18) into (19), a relation between 6; , @ follows: 


V 


(19) 


cos Bir = COS Bor. (20) 


R.+ R, + hy 
In a similar manner, a relation can be derived for EL, 6; . When C; = I, 
61 = B. = 0 and, from (14), 


(RnRz — Re) 


miles R,z Gams 


(21) 
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Now, for any EF = EE), diode pb, switches to its back resistance when 
[Fades a Teal 
Rez (81 = B2 = 0), 


where Jo is the bias current in each diode and is given by (13) for Bp, = 
Bo = 0. Thus, 


E cos Byr => 7D => Io (22) 


V 


tO 5p ea? 


(23) 
where 
V = de bias voltage 
A = forward resistance of diode. 
Curves of power output and amplitude modulation suppression have 
been calculated and are shown in Fig. 6. 


A.4 Circuit Simplification 


A natural question to ask about the limiter is, “Can perfect limiting 
be obtained when R, = R, ?” If so, the circuit is much simpler. The 
answer to this question is yes, and this can be demonstrated by con- 
sidering the numerator of S.. Perfect limiting is achieved when this 
numerator is zero. Setting the numerator of S.2 equal to zero and letting 
R, = R,, the resulting expression is solved for R, : 





Ry 1 
Ro ares (ts. 5) . (24) 
3 To + Ro : 
where 
hi, 2 0. (25) 


Using this fact in (24), it follows that perfect limiting can be obtained 
when fi, = R, if the following inequality is satisfied: 


di Pd 

279 2Ro ~ 2° 

Let us examine this further by using the idealized diode character- 
istics of (17). Substituting these expressions into (26), we get 


(= Br) 1 (= bn) 1 ae (27) 
Bir (1 4 A ) Bow (1 ee A \o 2° 
2(B — A)B; 


This inequality can be met easily with good diodes, i.e., diodes in which 
B/A> 1. 


(26) 
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Figs. 9 and 10 show a limiter and experimental results for the case 
R, = R,. It should be noted that, even with poor diodes, the limiting 
can be perfect if 2, and FR, are allowed to differ. This can be seen in 
the expression for S2. 


A.5 The “Clipper” Limiter 


A well-known limiter has the configuration of Fig. 1. The behavior 
of such a “clipper” limiter can be examined by setting R, = R, and 
letting R, and V increase without bound in such a way as to maintain 
the initial bias current J) unchanged. This will also insure that 8; = Be 
andr; = R;. 

If these exercises are performed on (14) and the results substituted 
into (16), the limiting is found to be 


limiting = Wh oie (28) 
where 
. i = carrier voltage amplitude, 
Io = constant current de bias, 
r, = [2(B — A)/z] sin Bir. 


Several remarks can be made concerning the “clipper limiter’ of (28): 

i. Before the diodes begin to switch, r, = 0 and there is no limiting. 
This is to be expected. 

ii. If J) = O there is no limiting. This too is to be expected because, 
with no bias, one diode has its back resistance in the circuit while the 
other one has its forward resistance in the circuit, and vice versa. The 
circuit is linear under these conditions and cannot limit. 

iii. Expression (28) indicates that the larger r;, the better will be 
the limiting for a given J). But 7, is proportional to (B — A), so this 
indicates a good diode for a limiter of this type is one with a high back 
resistance B and a high ratio of back-to-front resistance. 

iv. If the diodes have a finite back resistance, perfect limiting cannot 
be achieved. Furthermore, as 6; approaches 1/2, the limiting tends to 
zero. The reason for this is that 7; is proportional to sin 6,7, which tends 
to a constant value as 6B, approaches 1/2 while # increases without limit. 

The output carrier current can be calculated in the same way as (28). 
It is 

EK + Qn 0 


OF oe oR ee 
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Comments can also be made concerning this equation: 

i. At low levels of input voltage the diodes do not switch and 7, = 
ro = 0, and ro = A. The output is then C, = H/[2(R; + A)], as ex- 
pected. 

ii. If I, = 0, then 6, = Bo = 1/2, resulting in) = (B + A)/2 and 
ro = O. In this case C, = E/(B + A + 2R)) and the circuit is linear. 

In the above discussion of the ‘clipper’ limiter, all harmonics of the 
carrier were suppressed. Two other cases have been considered: in one 
case, the third harmonic was allowed and, in the other, the third and 
fifth harmonics were allowed (even harmonics cancel). The analysis 
follows the same pattern as previously demonstrated except that cur- 
rents at all allowed frequencies are assumed. Only the results will be 
shown. 

When the third harmonic is allowed, limiting is given by: 


1 


1 Cos Br | r3(r2 + 14) | (30) 
Qe eS 
R,+A 2R, + 2ro + re 


Similarly, when the fifth harmonic is added, 


L= 


cos Br 


Boca {ry — rsl(re + 74)(2R, + 2ro + rio) 





L= (1 + 
— (re + 7s)(rs + 76)] (31) 
= rsl(re as ra)(12 Fs 1's) (4 + re)(2Ri + 279 + rh) 


Equations (80) and (31) are plotted in Fig. 3. The curve for all harmonics 
was calculated by a Fourier analysis of the “clipped” wave. 


APPENDIX B 


In this Appendix it will be shown that the amount of limiting is not 
necessarily given by the slope of the carrier transfer function. 

Assume that the limiter of Fig. 5 is adjusted for perfect limiting for 
the carrier amplitude H = £,. In the equivalent circuits of Fig. 11, it 
is noted that the output carrier current C, depends upon the de coupling 
through R,. Note further that the sideband current S, depends upon 
the baseband coupling through Ft, . 

By proper filtering, the de and baseband currents can be made to 
flow through separate resistors without changing the behavior of the 
limiting or carrier transfer functions. With respect to FR, , the equivalent 
circuits of Fig. 11 are then completely independent. If the baseband 
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resistance in Fig. 11(b) is changed, S:2 will clearly change. But the car- 
rier transfer function is undisturbed, since no change was made in Fig. 
11(a). Therefore, the limiting is not completely determined by the slope 
of the carrier transfer function. 

This behavior has been verified experimentally using a limiter similar 
to that of Fig. 7. 
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Oxides of the 3d Transition Metals’ 


By F. J. MORIN 
(Manuscript received April 16, 1958) 


The magnetic, electrical and optical properties of the 3d metal oxides are 
examined for clues to the energy band structure. A tentative energy band 
scheme is proposed for the oxides of scandium, titanium and vanadium 
which suggests that the nonbonding 8d orbitals of neighboring cations overlap 
sufficiently to form a 3d conduction band. The 3d orbitals do not overlap in 
the oxides of the remaining 3d metals, with the consequence that the 3d 
electrons are in isolated energy states and electron transport occurs by electron 
exchange between cation neighbors and involves an activation energy. An 
energy level scheme for these latter oxides 1s calculated from simple electro- 
statics, and it is shown to agree with the observed electrical properties. Con- 
duction in these oxides includes transport of high-mobility holes in the 2p 
band of the oxygen lattice and of low-mobility holes and electrons in the 3d 
levels of the cation lattice. 


I. INTRODUCTION 


The transition metals combine with other elements to form a host of 
compounds which are electronic conductors and which range in chemical 
bonding from ionic (oxides), through covalent (sulfides, arsenides) to 
metallic (carbides, nitrides). This range in valence character promises a 
great variety of energy band structures and transport processes and offers 
a rich field for theoretical and experimental investigation. No detailed 
determination of band structure or transport has been made for any of 
these compounds, and so they constitute an enormous gap in our knowl- 
edge of the solid state. Indeed, except for the 3d oxides, the semicon- 
ductor properties of transition metal compounds are largely unknown. 
In recent years the use of these oxides in electric! and magnetic? circuit 
elements-and in heterogeneous catalysis,’ as well as their importance in 
corrosion’ have emphasized the necessity for a fundamental understand- 
ing of their properties. Now, from our point of view, the most funda- 
mental aspect of a solid is its energy band structure. Therefore, it will 


* This article is a chapter from Semiconductors, edited by N. B. Hannay, to be 
published by Reinhold Publishing Corp., New York, as part of the American 
Chemical Societ y Monograph Series. 
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be the concern of this paper to provide-a tentative energy band scheme 
for the 3d oxides. The discussion will be restricted to the simple oxides 
and avoid the more complicated situations to be found in the oxide 
compounds, such as garnets, spinels and perovskites. Because of the 
scarcity of reliable experimental data on single-crystal material, the 
proposed scheme can only be regarded as tentative. At the same time, 
it provides a useful frame of reference for the planning and interpretation 
of future experiments. 

We can begin by considering a relatively simple oxide, for example, 
zine oxide, which is composed of ions having a closed-shell electronic 
configuration. The energy bands in zinc oxide arise, to a first approxima- 
tion, from the filled 2p levels of the O= and the empty 4s levels of the 
Zn*+ which are broadened when the ions are brought together to form 
the solid. The filled 2p band is separated from the empty 4s band by a 
forbidden energy region. Thus, zinc oxide is a semiconductor. In the 
same way, a 2p band and a 4s band are expected to arise in the 3d oxides. 
However, it is not easy to see what to expect from the partially filled 3d 
levels of the cations, and we must attempt to answer such questions as (i) 
whether the 3d wave functions overlap sufficiently to form a 3d band 
in any of the oxides or if they do not overlap, so that the 3d levels are 
localized energy levels, and (ii) the location in energy relative to the 3d 
levels (or band) of the 2p band of the anion lattice and the 4s band of 
the cation lattice. 

To learn something about the properties of a 3d band it is instructive 
to consider the energy band situation in the 3d metals. nergy band 
calculations have been made for some of these metals, using the cellular 
and the tight binding methods.® Usually it is found that the 4s band is at 
least 10 ev wide and overlaps the 3d band, which in nickel, for example, . 
is about 2.8 ev wide. On the other hand, the 3d band can hold ten 
electrons per atom, while the 4s band can hold only two electrons. This 
difference in numbers accentuates the difference in bandwidth: per 
electron, the 3d band in Ni is only 75 as wide as the 4s band. The narrow- 
ness of the band means a high effective mass and low mobility for charge 
carriers in the band. In the metals, the electric current is carried largely 
by electrons in the 4s band, but these have their mobility decreased by 
transitions to the 3d band. In the 3d oxides, the increased internuclear 
distance can be expected to reduce the 3d bandwidth still further, be- 
cause the overlap of the 3d wave functions will be greatly reduced. This 
will result in a further increase in effective mass and lowering of mobility. 
Later in this paper it is shown that the 4s band probably does not overlap 
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the 3d band in most of the oxides, so that electronic conduction can be 
entirely within the 8d band and thus involve carriers of very high ef- 
fective mass and very low mobility. On the other hand, transport in a 
4s or 2p band, should it occur, will probably be evidenced by the rela- 
tively high mobility of low-mass carriers in a wide band. Results of soft 
X-ray emission indicate the 2p band in oxides to be 10 to 20 ev wide.® 

When the 8d band becomes extremely narrow it is no longer meaning- 
ful to assign a width to the band, and the 3d charge carriers can be con- 
sidered to occupy energy levels localized on the cations. It was first 
pointed out by de Boer and Verwey’ and by Mott’ that this situation 
must exist in many oxides such as CreO03, Mn2O;3, Fe203, CoO, NiO 
and CuO which are insulators when pure and stoichiometric and have 
room-temperature resistivities exceeding 10'° ohm-cm. In order that 
conduction in the 3d levels can occur there must be present, in pure 
NiO for example, some Ni*** and Nit ions (electron hole pair) which 
can move about by exchanging electrons with the normal Nit* of the 
lattice. These excited states, however, can only exist at high tempera- 
tures. Nickel oxide can be made conducting at room temperature by 
introducing Nit** ions into the lattice, either by a departure 
from stoichiometry (vacant Ni++ sites) or by the substitution of Lit 
ions for Nit ions. Wandering of the Nit*+* about the Nit lattice by 
electron exchange is possible even though the cation separation of 3A is 
twice the cation diameter. Transport of this type is expected to involve 
an activation energy associated with self-trapping of the charge carriers. 
Mobility will be extremely low and increase exponentially with tempera- 
ture. Such behavior is distinctly different from transport in a band where 
mobility decreases with increasing temperature because of lattice scatter- 
ing. Semiconductors in which transport is by electron transfer do not 
show many of the effects which have been used for study of the band 
structure and transport in semiconductors like silicon or germanium. 
For example, no Hall effect, photoconductivity or carrier injection is 
observable, so that transport measurements depend primarily upon 
Seebeck effect and electrical conductivity. 

The oxides series to be considered in this paper is the following: 








TiO(d) FeQ(d’) | CoO(d?)|NiO(d*)|CuO(d*) 
Seo02(d°) | Tiz03(d") | Ve Os(d?)|Crz Os(d)|Muny O3(d!) | Fez Os(d°) 
Ti0.(d°) 








The electronic configuration of the 3d shell is indicated. The 4s shell is 
assumed to be empty in all cases. The useful data concerning these oxides 
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are scant and most of them have been taken on ill-defined polycrystalline 
samples. However, from these data we shall try to derive a general 
energy band scheme for the 3d oxides. Particular attention will be paid to 
transport and optical absorption, because these properties give an 
indication of 3d bandwidth. Also, because the majority of the 3d oxides 
are antiferromagnets, any correlation between their magnetic and elec- 
tric properties will be considered. In Section II a brief description of 
their magnetic properties is given. In Section III crystal field theory is 
discussed in a simple way and used to explain the available absorption 
spectra of the 3d oxides. In Section IV it is shown to be probable that a 
3d band having appreciable width exists in the oxides of scandium, ti- 
tanium and vanadium, whereas in the remaining oxides the 3d wave 
functions appear to be localized on the cations. In Section V the relative 
energies of the 4s, 3d and 2p levels are calculated. This energy level 
scheme is shown to fit fairly well the experimental results on NiO (Sec- 
tion VI) and Fe.03 (Section VII). Finally, in Section VIII, a few specu- 
lations are made concerning transport in the localized 3d levels. 


II, ANTIFERROMAGNETISM® 


The transition metal oxides are antiferromagnetic, a state character- 
ized by an ordered antiparallel arrangement of electron spins. Since the 
electronic structure of a solid underlies its magnetic as well as its semi- 
conducting properties, it is not surprising that experiments on oxides 
have shown a connection between these two properties. Two ways of 
explaining magnetism have been used—the energy band approach for 
transition metals, and the ionic approach for transition metal oxides. 
In transition metals, the magnetic atoms lie close together and there is 
an interaction between the spin moments of neighbors. This exchange 
interaction depends upon the overlap of atomic orbitals. For atoms hav- 
ing a partially filled 3d shell the interaction may be either negative (anti- 
parallel spins) or positive (parallel spins), depending upon interatomic 
distance, and changing from negative to positive with increasing dis- 
tance. It is possible to divide an antiferromagnetic metal crystal into 
two magnetic sublattices, with each atom having nearest neighbors of 
opposite spins. The situation is different in transition metal oxides. In 
these, the magnetic cations are separated by anions. An example of this 
is MnO, whose sodium chloride structure is shown in Fig. 1. It is impos- 
sible for atom A’ to have a spin direction antiparallel to that of all of its 
nearest neighbors, since one half of these B’, C’ and D’ neighbors must 
have parallel spins to give an equal number of up and down spins in the 
lattice. There is one spin ordering, however, in which A’ is antiparallel 


OXIDES OF THE 3d TRANSITION METALS 1051 





Fig. 1 — The sodium chloride structure and antiferromagnetic spin alignment 
in MnO. Spins in a (111) plane are parallel, with the (111) planes coupled anti- 
parallel to each other. (After Gorter, Ref. 2.) 


to all of its next nearest neighbors; this is shown in Fig. 1, and is the 
one actually found by neutron diffraction. 

But now the antiparallel magnetic ions are separated by anions and 
the problem is to explain how exchange interaction can occur when 
electron orbits do not overlap. One way of approaching the problem 
was suggested by Slater.’ Because of the exclusion principle, electrons of 
like spin will tend not to occupy the region between anion and cation. 
That is, electrons with + spin on a cation will repel electrons with + 
spin on the neighboring anion somewhat more than electrons with — 
-spin. This repulsion will deform the outer electronic shell of the anion. 
In Fig. 1 it can be seen that anion X, for example, has three Mntt 
neighbors (C’, B, D) in a (111) plane (dotted diagonals) and having + 
spin. Anion electrons with -++ spin will tend to collect on the side of the 
anion opposite from this plane of cations. This tendency will be aided if 
the (111) plane (solid diagonals) of cations (C, D’, B’) on the opposite 
side of X have — spins, and it may be seen in Fig. 1 that this is the 
case. Thus, the antiferromagnetic alignment between spins in the (111) 
planes represents a lower energy state than does the ferromagnetic 
alignment. A different quantitative description of the coupling between 
electron spins of two cations separated by an anion has been given by 
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Anderson!® and by Yamashita and Kondo." It is called superexchange. 
Since the p orbitals of the anion are directed 180° apart, exchange is 
found to be strongest when cations are on opposite sides of the anion 
and weakest when they are at 90° from the anion. 

At high temperature, thermal vibrations prevent the antiparallel 
ordering of spins, and the material is paramagnetic. The transition 
temperature separating the paramagnetic and antiferromagnetic states 
is called the Néel temperature, 7'y. A plot of Ty as a function of the 
number of spins per cation is given in Fig. 2 for the oxides which are of 
interest in this chapter. The transition at 7'y is characterized by the 
occurrence of maxima in specific heat, magnetic susceptibility and the 
expansion coefficient. Electron spin resonance disappears below T'y . 
The direction of spin alignment can be determined by neutron diffraction 
and magnetic susceptibility, using single crystals. 

In the rhombohedral sesquioxides TizO;, V2O3, CreO3; and Te.0; 
the cations are in pairs of closely spaced parallel layers, each pair of 
layers separated from the next pair by a layer of oxygen ions. The spatial 
arrangement of the ions projected onto a plane is shown in Fig. 3. Cation 
layers are represented by solid horizontal lines and anion layers by 
dotted horizontal lines. The heavy lines show how four cations are 
coordinated with each anion. Each cation participates in 18 cation- 
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Fig. 2 — Plot of Néel temperature, Ty, against the number of spins per cat- 
ion for some of the 3d oxides discussed in this paper. 
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Fig. 3 — A projection onto a plane of the spatial arrangement of a group of 
ions in the rhombohedral sesquioxides. Cation layers are represented by solid 
lines and anion layers by dotted horizontal lines. The spin orientations of the 
cation layers are shown for Fe.O; and Cr.O;. The cation-anion-cation paths of 
superexchange are indicated by the heavy lines which do not include a 90° angle. 
(After Gilleo, Ref. 35.) 


anion-cation configurations, one-third of which are shown in Fig. 2. 
Because half of the configurations include an angle of about 90°, only 
nine configurations are important in exchange interactions. The spin 
alignment shown in Fig. 3 was found by neutron diffraction!’ and by 
magnetic susceptibility.!? Spin alignment has not been determined in 
Ti:O3 and V203 ‘: 

A second magnetic transformation is possible in the rhombohedral 
sesquioxides; that is, a change in the direction of spin alignment from 
perpendicular to the three-fold axis, as in Fig. 3, to parallel to this axis. 
According to neutron diffraction,’ magnetic susceptibility’ and spin 


1054 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 


resonance,!® this transformation occurs between —30°C and —15°C in 
Fe.03 : 

Another interesting property of Fe:O; is its weak ferromagnetism,!® 
which persists up to 7’y and has been explained on the basis of sym- 
metry considerations.” The internal magnetic field due to the ferro- 
magnetism produces an anomalous Hall effect,!® which depends upon 
the magnetization of the material and disappears above Ty . This Hall 
effect probably can be understood on the basis of the theory!® for a 
similar effect found in the transition metals. It cannot be used as a 
measure of carrier concentration in the usual way. 

Exchange interaction, besides producing spin alignment, also tends 
to move the ions in such a way as to increase the exchange, and thus it 
causes lattice deformation. The net number of antiparallel neighbors 
increases with decreasing temperature because the opposing effect of 
thermal vibration decreases; correspondingly, lattice deformation 
increases with decreasing temperature. At temperatures below Ty , the 
exchange coupling between the (111) planes of spin cause rhombohedral 
distortion in MnO, FeO and NiO and tetragonal distortion in CoO.?° 
Lattice distortion below T'y is also found in the rhombohedral sesquiox- 
ides. The exchange coupling between the cation layers contracts the 
lattice along the trigonal axis! but does not change the symmetry. 


III. ATOMIC ORBITALS AND THE CRYSTAL FIELD” 


If the 38d wave functions in an oxide do not overlap to form an energy 
band but remain concentrated near the cation, the cation, in effect, is 
isolated. It experiences, to a first approximation, only the electrostatic 
field due to the surrounding oxygens. Such a situation exists in the 
hydrated salts of the transition metals, for example the complex 
Ni(H,0),+*, where the cation is surrounded octahedrally by six H.O 
molecules with the oxygens (the negative end of the water dipole) nearest 
the cation. Similarly, in NiO, which is a face-centered cubic structure, 
the negative oxygens are also arranged in a regular octahedron about 
the Nit. In fact, the optical absorption spectrum for NiO is very similar 
to that for the hydrated salt. Therefore, these considerations suggest 
that the same theories might be applied to the 3d oxides as have been 
used in recent years to explain the optical properties of complex salts of 
3d ions. Conversely, this implies that, if a given oxide spectrum can be 
understood in this way, the 3d wave functions in that oxide are con- 
centrated near the cation and do not overlap to form a band. Accordingly, | 
we will now see what available spectra indicate as to the possible ex- 
istence of 3d bands. 
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Three theories have been used to account for the properties of the 
complexes: 


1. molecular orbital,?° 
2. valence bond,” 
3. crystal field. 


The advantage of the crystal field theory is that it can give information 
in a simple way about the excited states of the cation. Thus, it furnishes 
an explanation of the absorption spectra and promises to give insight 
into the electric and magnetic properties of the oxides. 

The crystal field theory assumes a purely ionic model and considers 
the effect of the octahedrally coordinated negative anions on the 3d 
electrons of the central cation. The geometry of the cation and its 
surroundings is shown in Fig. 4. Suppose, at first, that the central ion 
has a single 3d electron with the quantum numbers / = 2, s = 34. There 
are five (21 + 1) d orbits, all having the same energy in the isolated ion. 
The associated wave functions can be expressed as a function of radius, 
r, multiplied by functions with angular variation of the form zy/??, 
y2/1?, zx/r®, (a — y?)/r? and (222 — 2? — y*)/r?. These orbitals can be 
represented by the charge cloud distributions shown in Fig. 5. The zy, 
yz and zx orbitals are designated as de and the other orbitals as dy. 
When negative anions are on the 2, y, z axes, the negative charge clouds 
of the electrons in the dy orbitals, which are directed at the anions, are 









@ CATION 
@ ANION 


Fig. 4— A regular octahedron of anions surrounding a central cation. 
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Fig. 5 — Angular distribution of the five 3d orbitals. 


repulsed by the charge cloud of the anion. Thus, the dy orbitals have a 
higher electrostatic energy than the de orbitals, which are directed be- 
tween the anions, and the ground state of the central ion is split as shown 
by the energy level diagram Fig. 6. This diagram is for an anion ar- 
rangement having cubic symmetry, which occurs in the rock salt oxides, 
and trigonal symmetry, which occurs in the rhombohedral sesquioxides. 
The trigonal case, since it involves a regular octahedron of anions, 
yields nearly the same excited states as does a cubic field, with an addi- 
tional broadening of the absorption bands due to the small splitting of 
the de levels shown. 


‘d x 


[xy | yz | zx [xe-ve] 222 ENERGY 


FREE ION 
ey: UZ | -2e 
eae 
CUBIC TRIGONAL 
FIELD FIELD 


Fig. 6 — The ground state configuration of the five 3d orbitals: when in the 
free ion; in a crystal field of cubic symmetry (as in MnO, FeO, CoO and NiO); 
in a crystal field of trigonal symmetry (as in the rhombohedral sesquioxides). 
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An important rule concerning the splitting of these levels is that 
their centers of gravity in energy are not altered by the perturbation. 
That is, 


3E, + 2K, = 0. (1) 


Fixing the energy scale to be zero at the ground state of the free ion 
and defining 10 Dq as the energy difference between the two levels 
which a single 3d electron can occupy when subjected to a cubic field 
gives, from (1) 


hi, = —4Dq, (2) 
Ey = +6Dgq. (3) 


The relative positions of the excited states above the ground state, as 
determined for ions with electron configurations d! to d® subjected to a 
cubic field, are shown in Fig. 7. To the left in each diagram are shown 
the free ion ground states, labeled with their term symbols. In diagrams 
(c) and (d) are also shown the first free ion excited states having the 
same spin multiplicity as the ground states, and in (b) two excited 
levels for the free ion are depicted. To the right in each diagram are shown 
the ground state and excited states for the ions in the solid, as produced 
by a cubic crystal field of strength Dq. Configuration d*, shown in (b), is 
plotted in a different way (after Orgel?) from the other configurations, 
because its excited states but not the ground state are split. In these 
diagrams, the first excited state of the ion corresponds to the excitation 
of an electron from the de to the dy levels. For example, in NiO, con- 
figuration d, the first excited state of the nickel ion is due to the electronic 
transition represented by the change in configuration d*’e d?y — d'e dy, 
which leaves a hole in the normally filled de levels. The energy of these 
two states can be calculated as follows. The ground state, d%e d?y, is 
located at (—4Dq X 6 + 6Dq X 2) = —12Dgq. The excited state, 
d’« d*y, is located at (—Dq X 5 + 6Dq X 3) = —2Dq. These energy 
levels are shown in Fig. 7(d). These transitions became allowed as a 
result of lattice vibrations?® and hemihedral distortion.” 

The available absorption spectra for some of the oxides are shown in 
Fig. 8, together with the location of the absorption maxima of a hy- 
drated salt of the cation in aqueous solution. The close correspondence 
between maxima found in hydrated complex and oxide for Fet*t+ and 
Nit indicates that the cation is isolated in the oxide as it is in the 
hydrated salt, and suggests that the 38d wave functions do not overlap 
in the oxides of the metals iron and nickel. The spectra of Cr.O03, how- 
ever, show a slight shift to frequencies lower than that of the hydrated 
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Fig. 7 — Positions of some excited states above the ground state for ions with 
electron configurations d! to d°. In each diagram the free ion is to the'left and 
labeled with appropriate term symbols, the ion in a cubic field is to the right. 


salt, indicating a slight overlap of the 3d wave functions. This shift 
becomes very large in the spectra of Ti,03, probably indicating that the 
cations in this oxide are not isolated as in their hydrated salts, and 
may be due to the overlap of 3d orbitals, with the formation of a 3d band. 

From the spectra, the energy level diagrams of Fig. 7, and the values 
for the free ion excitation energies, X (from Ref. 28), the information 
given in Table I is obtained. The observed frequencies in the oxides, 
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Fig. 8 — Absorption spectra for TizO3 , CreO3 , FezO; and NiO compared to the 
location of absorption bands for hydrated salts of the corresponding cations. 
(Ti203 spectra after Pearson, Ref. 32; Cr.O3 spectra after D. S. McClure, unpub- 
lished; hydrated salt spectra after Holmes and McClure, Ref. 48.) 


shown in Column 3, were used to calculate the values of Dg in Column 4. 
In turn, these were used to obtain the calculated frequencies in the final 
two columns. 

Another bit of information can be obtained from the absorption 
spectrum of Fe.O; containing titanium, Fig. 8, which shows an addi- 
tional absorption band appearing at 6100 cm-! when one per cent ti- 
tanium is added substitutionally. Titanium is a donor in FeO; and, in 
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TABLE IJ — SpEcTRA OF OXIDES AND HyDRATED SALTS 











Type of Frequencies 
Ton Comparison 

of Spectra v1(cm=) Da(cm=) y2(cm=1) v3(cm-!) 

Tit exp. 6000 — — — 

calc. — 600 — — 

aq. 20300 — — — 

Vttt aq. 17800 — 25700 — 
Crttt exp. 16800 — 21300 > 27000 
calc. — 1680 24000 380000 

aq. 17600 — 24700 —- 

Fettt exp. 12200 — > 15500 — 
aq. 12600 — 18200 24500 
Nitt exp. 8600 — 15400 > 22000 
calc. — 860 15500 26000 
aq. 8600 — 14700 25500 














Comparison of spectra of oxides: exp. (measured); calc. (calculated from oxide 
data and diagrams Fig. 7); aq. (measured on hydrated salts). Data for the hy- 
drated salts were taken from O. G. Holmes, and D. 8. McClure.*? 

Note: 1 ev = 8100 cm™ 


this concentration, it is nearly all ionized to give Tit*** and Fe** ions. 
This absorption band, therefore, is probably due to the Fet* ions, so 
that, from this information, Dg is 610 cm for the Fet+ environment 
in Fe,03;. Now the value of Dq for Fet* in Fe(H.O).t* is 1000 cm, 
which can also be taken to be the value of Dg in FeO. The smaller value 
found for the Dq of Fet* in Fe.O; suggests that the anions surrounding 
the Fet+ ion are at a greater distance from the Fe+t ion in Fe2O3 than 
in FeO. This is just what one would expect to find if the conduction 
electron (Fet++) in FeO; polarized (repulsive polarization) the surround- 
ing lattice. 


IV. THE 3d BAND 


The inner electronic configuration of the 3d metals is that of argon. 
As electrons are added to the argon core to build up the series, the nuclear 
charge is also increased. The electron cloud is thus subjected to an 
increasing nuclear charge but, on the other hand, the mutual repulsion 
of the electrons increases. The over-all decrease in ionic radius going 
from Catt+(r = 0.99A) to Zntt(r = 0.834.) indicates that nuclear 
attraction predominates over mutual repulsion, and the electron cloud 
becomes contracted. Therefore, if 8d wave functions overlap sufficiently 
to form a band in any 3d oxides, such band formation is to be expected 
at the beginning of the 3d series. Since, in fact, oxides such as Cr2Q3 , 
Fe,O3 and NiO normally are insulators, it is in the oxides of Sc, Ti and V 
that a 3d band might be expected. In the previous section it was sug- 
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Fig. 9 — The rapid decrease in ionic radius with increasing atomic number for 
the fant few members of the 3d series. This implies a contraction in the 3d wave 
functions and, therefore, a decrease in the amount by which they may overlap to 
form a 3d band. (Ionic radii from Goldschmidt, Ref. 44). 


gested, on the basis of optical absorption, that there was evidence that 
such might be the case in Ti,O3. A comparison of the ionic radii of the 
trivalent cations plotted in Fig. 9 shows how rapidly the radius (and 
therefore the extent of the 3d wave functions) decreases with increasing 
atomic number in the first three members of the series. The abrupt 
increase in the radius of Mnt+* does not contradict this idea, since it is 
due to the addition of the first dy electron. This, since it interacts with 
the electron cloud of the anion, increases the effective radius of the 
cation.?® Since no transport information exists for Sc.03, the data for 
the oxides of Ti and V will be examined for evidence of a 3d band. 


4.1 T7102 (Configuration d°) 


In TiO, , the electronic configuration of Tit**t* is 3d°, so that if a 3d 
band exists we expect it to be normally empty. We have, however, the 
possibilities of supplying electrons to the 3d band by excitation from 
imperfections or from the filled 2p band. Hall mobility of conduction 
electrons has been measured by Breckenridge and Hosler*® in ceramic and 
single-crystal samples of TiO. (rutile). The electrons were thermally 
excited from lattice defects introduced by slight reduction of the samples. 
Room-temperature mobility ranged from 0.1 to 1.0 em?/volt-sec for all 
samples, the crystals tending to have’ the higher values. The mobility 
decreased in a normal way with increasing temperature. This mobility 
is 100 to 1000 times smaller than the electron mobility found in ZnO, 
and is precisely what is expected from transport in a 3d band. 
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Fig. 10 — Photoconductivity in stoichiometric single crystal rutile. (After 
Cronemeyer, Ref. 31.) 


Conductivity, photoconductivity and optical absorption have been 
measured by Cronemeyer*® on pure stoichiometric single crystals of 
rutile. These data indicate an intrinsic forbidden energy gap of 3.05 ev. 
Above 1200°K, however, the conductivity temperature dependence 
suggests an increase in the energy gap to 3.67 ev. Presumably, the 3.05 
ev represents an electronic transition from the filled 2p band of the oxygen 
lattice to the empty 3d band, which was indicated by Hall mobility to 
be the conduction band. The increase in the energy gap by as much as 
0.6 ev at high temperature, suggests two possibilities: either the 4s band 
of Tit++ is near the upper edge of the 3d band, and transitions to that 
band are also important, or the 3d band is split into two sub-bands. 
Discussion in Section V shows that the 4s band is probably too high in 
energy to play a role in conduction in the oxides. The second alternative 
—splitting of the 3d band into de and dy sub-bands—seems to be the 
most likely. 

The photoconductivity data, Fig. 10, only extend to 3.4 ev, pre- 
sumably not high enough in energy to show the peak of the dy band. 
At room temperature, photoconductivity shows a single absorption 
band having a maximum at 3.0 ev, probably the de band, and a broad 
shoulder at higher energy which may be the beginning of the dy band. 
The width of the main absorption band seems to be a few tenths of an 
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electron volt, which is very reasonable for a 3d band. At 103°K, this 
band gives an indication of splitting; this is not surprising, since the 
cation in the rutile structure is not at the center of a regular octahedron 
of anions. 


4.2 TiO (Configuration d*) 


Besides TiO, , the only oxide for which there is fairly clear evidence 
of a 3d band is TiO. This oxide behaves like a metal, indicating the exist- 
ence of a partially filled 83d band. According to Pearson, room-tempera- 
ture conductivity is 3500 ohm—'cm— and increases slowly with decreasing 
temperature. Conductivity also decreases in a ratio of 7:12 when the 
oxygen content is increased from TiOo.7 to TiO... This is to be expected 
if the effect of changing the stoichiometry is to produce an equal change 
in the number of 3d electrons in the 3d band. If all the 3d electrons, 
6 X 10” for stoichiometric TiO, are assumed to be involved in con- 
duction, the electron mobility can be calculated to be» = o/gn = 3500/ 
(1.6 X 10-9 X 6 X 10”) = 0.36 cm?/volt-sec at room temperature. 
This is comparable with the mobility in TiO» and suggests conduction 
in a 3d band in this case also. 


4.3 T1203 (Configuration d!) and V203 (Configuration d*) 


Since these oxides contain, respectively, one and two 3d electrons per 
cation, metallic conduction is expected if a 3d band exists in them, just 
as in the case of TiO. Conductivity data for these oxides are shown in 
Figs. 11 and 12. The outstanding fact about these data is the large dis- 
continuity in conductivity at 7'y for a number of samples. (The lack of a 
discontinuity in some cases will be discussed later.) No other oxides 
have shown such behavior, and it is this behavior which suggests an 
energy band scheme that explains fairly well the information available 
for these oxides. To begin with, one expects the 3d band to be split by 
the crystal field into the dy and de sub-bands. It is proposed, further, 
that the de band may be split by the magnetic exchange interaction into 
a lower filled band and an upper empty band. Thus, in a pure stoichi- 
ometric sample at temperatures below 7’'y , the Fermi level would be 
midway between the occupied de band and the empty de band. When 
the temperature of the sample is raised through T’y , the bands would 
collapse into a single partially filled band and cause metallic conduction. 
Sample A of V.O3 behaves in this way. If it is assumed that carrier 
mobility does not change very much on going through 7'y , and that the 
conductivity change is due only to the change in carrier concentration 
resulting from the collapsing of de bands, the amount of discontinuity 


1064 THE BELL SYSTEM TECHNICAL JOURNAL, JULY 1958 




















CONDUCTIVITY IN OHM7! CM 











1000/TE MPERATURE IN DEGREES KELVIN 


Tig. 11 — Conductivity of Ti,0; through the Néel temperature. (Sample A, 
Foéx and Loriens, Ref. 45; sample B, Pearson, Ref. 32.) 
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Z ee 2 — Conductivity of V2.0; through the Néel temperature. (After Foéx, 
ef. 46. 
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to be expected in the conductivity of a pure stoichiometric sample can 
be estimated. This will be simply a factor of exp (4/kT'y), where [Hp is 
taken to be the slope of the conductivity curve below 7'y . For V2O3 
sample A, this factor is exp (0.2/8.63 X 10-5 X 170) = 8 X 105, which 
is very close to the measured discontinuity. For Ti,O3; sample A, the 
factor is exp (0.11/8.63 X 10-5 X 470) = 15, again very close to the 
observed discontinuity, although this sample does not show metallic 
conduction above 7'y . A remarkable point about these two cases is that 
the Fermi energy is changed at the discontinuity by an amount con- 
siderably greater than the exchange energy k7'y . 

If a sample departs from stoichiometry several effects can be imagined 
which would work to obscure the discontinuity: the odd, or missing, 
cations in the lattice will tend to decouple the spin system; they may 
break up the periodicity of the lattice and disrupt the 3d band; they 
will alter the conductivity by acting as donors or acceptors. Samples 
A and B of Ti.O3 and V20O3 sample B probably show these effects in 
varying degrees. With the band disrupted, transport probably occurs as 
in oxides such as NiO, and metallic conduction does not appear above 
Ty. The presence of donors or acceptors increases the conductivity 
below Ty and minimizes the effect of the magnetic transition on the 
conductivity. 

An estimate of electron mobility for V2O3 sample A above 7'y gives 
uw = o/qn = 60/(8 X 10” X 1.6 X 107°) = 5 X 10°? cm?/volt-sec. 
Thus, mobility in a single crystal may be of the order of 10-*cm?2/volt- 
sec, which is more than an order of magnitude less than in TiO so that 
the 3d band in VO; is probably not much greater than kT in width at 
‘temperatures above T'y . 

It seems reasonable to conclude from the considerations and results 
of this and the previous section that a 3d band exists in the oxides of 
scandium, titanium and vanadium, which probably goes to essentially 
zero Width in chromium oxide. A schematic of this idea is shown in 
Fig. 13. 


V. RELATIVE ENERGIES OF THE 4s, 8d AND 2p LEVELS 


_ We saw in the. last section that, of the 3d oxides, probably only those 
of Se, Ti and V are like ordinary semiconductors in that their charge 
carriers are in energy bands. When we come to Cr.O3; and the remaining 
3d oxides we find an entirely new situation. The 2p and 4s bands are 
quite comparable to the bands in ZnO, for example, and can be repre- 
sented on the usual one-electron energy diagram. However, the energy 
of conduction electrons and holes in the localized 3d levels must be 
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Fig. 13 — Schematic showing how 3d bandwidth goes to zero and effective 
mass, m*/m, of charge carriers increases as 3d wave functions contract with in- 
creasing atomic number. 


represented in a different way, since they are in excited states of the 
cation. . 

It is possible to find, in a very approximate way, the relative energies 
of the unbroadened 4s, 3d and 2p levels in an oxide by using data from 
the Born cycle, and spectroscopic data for the free ions and neglecting 
lattice polarization about an electron or hole. In Fig. 14, NiO is taken as 
an example to show how this is done. 

Charge carriers are produced by the following reactions, in which the. 
electronic states represented are separated by large distances in the 
crystal: 


O--(2p*) -+ Ni++(3d8) > O-(2p) + Nit(3a°), (4) 


Nit++(8d8) + Nit+(8d8) — Nit+(8d’) + Ni+(3d°), (5) 
NI*+*(3d8) — Nitt3d’4s > Nit++(3d") + 4s electron, (6) 


V-Nit++(3d?) + Nit+(3d8) = V-Nit+(3d8) + Nit++(3d7), (7) 
LitNit++(3d") + Nit+(3d8) > LitNi++(3d8) + Nit++(3d). (8) 


These reactions represent: (4) the formation of a hole in the 2p band and 
an electron in the 3d levels; (5) the formation of an electron-hole pair 
in the 3d levels; (6) the excitation of an electron to the 4s band and 
formation of a hole in the 8d levels; (7) the trapping of an electron 
at a lattice vacancy acceptor with the formation of a hole in the 3d 
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Fig. 14 — Lattice of NiO with Nit acceptors near a cation vacancy and a 
substituted Lit, Nit electron and Nit** hole, and a O7 hole in the 2p band. 


levels; and (8) the trapping of an electron at a Lit acceptor site with the 
formation of a hole in the 3d levels. Consider first the energy necessary 
to remove an electron from O-~ and place it on a distant Ni** in the 
lattice, as shown by reaction (4). The total electron affinity of an oxygen 
atom (O going to O-—), determined by the Born cycle, is repulsive and 
equals —7.5 ev. The affinity of O for the first electron is +2.2 ev;%4 
hence the negative affinity for both electrons is due to the affinity for 
the second electron (O- — O-~—), which therefore amounts to —9.7 ev. 
This energy is plotted in Fig. 15 relative to the 2p° level plus a free 
electron on the left of the energy diagram. The ionization potential of 
Nit(8d°) and the other spectroscopic data come from Ref. 28. This 
ionization potential is 18.2 ev, and it is plotted relative to the 3d° level 
in the diagram. (These energies are plotted with reversed signs, since in 
the process being investigated an electron is to be taken from O-~ and 
placed on the Nit, and this is the reverse of the process for which the 
energies are quoted.) When the ions are brought together to form the 
lattice, the potential at the position of a positive ion due to all the other 
ions in the lattice is negative, and that at the position of the negative 
ion it is positive. As a result, the O-~ level is depressed and the Nit* 
level is raised (as shown in the diagram) by the amount of the Madelung 
potential** of NiO, which is 23.8 ev. Thus we find these levels moved to 
—14.1 ev (2p°) and +5.6 ev (8d). The difference in energy of these two 
levels, 19.7 ev, is the energy of reaction (4), and it was arrived at by 
considering only free ion potentials, which are a measure of the energy 
necessary to remove an electron from a free ion out to infinity. In this 
process, the electron would experience the potential of the ion all the 
way to infinity. In the lattice, however, the process is carried out in a 
dielectric, so the energy required for the reaction is reduced by the di- 
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Vig. 15 — Energy level diagram calculated for NiO and showing the energy 
required for the reactions (4) through (7). 


electric constant of NiO. Taking this constant to be about 5, the sepa- 
ration of the two levels is thus reduced to 19.7/5 = 3.9 ev, as shown on 
the diagram to the right in Fig. 15. This energy is probably reduced 
even further by broadening of the 2p levels into a 2p lattice band, so 
that the energy required to produce a free hole in the 2p band and an 
electron in the 3d levels is expected to be less than 3.9 ev, in NiO. 

The next process of interest, given by reaction (5), is to take an electron 
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from a Nit* and place it on another Nit* far removed in the lattice. 
The 3d level is located at —36.2 ev, with respect to the 3d’ level in the 
left of the diagram. Adding to this the Madelung potential raises it to 
—12.4 ev. This lies 18.0 ev below the 3d? level calculated in a similar 
manner in the preceding paragraph. In the lattice, therefore, the transi- 
tion will require 18.0/5 = 3.6 ev. Thus, on the right-hand side of the 
diagram the 3d° level is plotted 3.6 ev below the 3d° level which had 
been located in the first calculation. 

Reaction (6) represents the transition of an electron from a Nit+ into 
the 4s band. The first step in calculating the energy required for this 
process is to take an electron from the ground state, 3d’, to the first 
excited 4s state, given by 3d’4s. The location of this excited state with 
respect to the ground state is plotted to the left of the diagram, at 
— 29.6 ev, and in the center, at —5.8 ev, by adding the Madelung po- 
tential, which places it 6.6 ev above the 3d level. Thus, it required 6.6 
ev to produce this excited state of the cation, which is.a bound hole- 
electron pair. The additional energy required to separate this pair 
(essentially to remove the electron from the field of the cation) is given 
approximately by q?/ea, where ¢ is the dielectric constant and a is the 
distance between cations. This energy amounts to 1.0 ev, so that the 
energy required to free an electron from a cation and place it in a band 
of 4s states is 7.6 ev. This is plotted with respect to the 3d° level at the 
right of the diagram. Due to broadening of the 4s levels into a band, the 
actual separation will be reduced to something less than 7.6 ev. 

Consider now the process (7), in which excess oxygen in the NiO 
lattice introduces vacant Ni**+ lattice sites, which act as acceptors and 
produce conduction holes in the 3d levels. The relative location in energy 
of such an acceptor level has been calculated by de Boer and Verwey,’ 
although they neglected the effect of the dielectric constant of the solid. 
Their argument is as follows. Each vacant Nit site is surrounded by 
twelve Nit ions, and the situation to be considered is that in which two 
of these ions yield an electron each to form Nit** ions that compensate 
for the absence of the Ni** ion. This is shown in Fig. 14. Each Nit** is 
surrounded by a normal lattice minus one Nit+ at a distance av/2/2, 
plus one positive charge (Nit*+) assumed to be at av/2, the greatest 
separation from the first Nit*+. Hence, the potential at one Ni*** site 
amounts to the normal Madelung potential plus 2@/a\/2/2 — 
G/av/2 = 23.8 + 9.8 — 2.4 = 31.2 ev. Thus, the level of this ion is 
raised 7.4 ev above that of the normal 3d° ions, as shown in Fig. 15. 
Considering the dielectric constant, the energy required to transfer an 
electron from a distant Nit+ ion to a Nit*+ next to a vacancy will be 
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7.4/5 = 1.5 ev. This level is plotted to the right in Fig. 15 with respect 
to the 3d’ level. On this same basis, in reaction (8) a substitutional Lit 
will produce a Nit** acceptor level at 0.5 ev above the 3d? level. 

The calculated energies involving four of these reactions are shown by 
vertical arrows to the right of the diagram. They will be used in Sections 
VI and VII as guides in analyzing the data for NiO and Fe,O3. It will 
be seen in Section VI that these calculated energies are probably larger 
than the actual energies found in NiO by a factor of approximately two. 
Because of this, it is suggested that the dielectric constant of NiO is 
nearer to ten than to the value of five assumed in these calculations. 

The Madelung potential is about the same for all the monoxides CuO 
through TiO. However, as we go from CuO to MnO, the second and 
third ionization potentials of the cation become progressively smaller. 
This raises the 3d levels in energy with respect to the 2p band. In MnO, 
for example, the 3d? level is about 4.5 ev above the 2p band, as compared 
with a separation of 3.9 ev for the 3d° level in NiO, (assuming a dielectric 
constant of five in both cases). Therefore, conduction by 2p holes may be 
less important in the oxides which are earlier in the series than is NiO. 

Madelung constants are not available for the individual ions in the 
rhombohedral oxides where anions and cations do not occupy equivalent 
lattice sites. Therefore, the relative positions of the 3d levels and the 2p 
band in these oxides cannot be determined easily. 

The location (as given in Ref. 22) of the 3d’4s state with respect to 
the ground state in the free ion indicates that the 4s band becomes 
farther removed from the 3d levels as the charge on the cation is in- 
creased. Thus, the 4s band probably plays no role in conduction in any 
of the oxides having cations with a charge +2 or greater. The energy 
of the 3d—3d transition is given by the difference in ionization energy 
of the ground states, 3d" and 3d”*!, divided by the dielectric constant 
of the oxide, as shown above for NiO. The only sesquioxide of interest 
here for which spectroscopic data are available is Cr2O; (which is one 
that does not have a 3d band), for which the interval 3d? — 3d?! is 18.6 
ev. This is about the same as 3d° — 3d? in NiO. 

The simple oxides, then, when pure and stoichiometric, can be expected 
to show conduction involving 3d holes and electrons and 2p holes, as a 
result of the processes shown in (4), (5) and (6). These are the processes 
corresponding to intrinsic conduction arising from excitation across the 
forbidden gap in the semiconductors discussed in earlier chapters. They 
will probably appear to be p-type, because conduction by high-mobility 
2p holes will predominate over the conduction by low-mobility electrons 
in the 3d levels. On the other hand, when an oxide is impure or departs 
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from stoichiometry, as in (7) and (8), its type will depend, in addition 
to this intrinsic behavior, upon the particular valence state of the cation 
and upon the valence states available to the cation. It was shown above 
how excess oxygen or substitutional Lit in NiO introduced Ni*+* ac- 
ceptor levels near to the 3d$ levels. In TiO2 , a departure from stoichi- 
ometry produces Tit** ions which are donors with respect to Tit*++ ions 
and introduce donor levels near to the 3d band. In general, when the 
cations are in their lowest valence state, a defect will be such as to cause 
the oxide to be p-type and, when the cations are in their highest valence 
state, a defect will be such as to cause the oxide to be n-type. Oxides such 
as Fe.O3, in which the cations are in an intermediate valence state, 
can be made to go either into n-type or p-type. However, at high tem- 
peratures even impure oxides will tend towards p-type, because of con- 
duction by high mobility 2p holes. The energy levels discussed in Section 
III, of course, exist in addition to those shown in Fig. 15, but they are 
excited states which do not contribute to conduction. 


VI. NICKEL OXIDE, NiO 


Much more information is available for NiO than for the other cubic 
oxides, so it will be considered in detail. The structure of NiO has been 
described in Section II, and a possible electronic energy level scheme has 
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Fig. 16 — The resistivity of NiO as a function of Li content. (Sample o, Morin, 
Ref. 37; sample x, Verwey, Ref. 47.) 
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been discussed in Sections IV and V. Nickel oxide is a p-type semicon- 
ductor, apparently even when pure and stoichiometric, and no n-type 
form has been reported. Room-temperature resistivity for pure NiO 
is approximately 10 ohm-cm. This can be lowered to about 1 ohm-cm 
by the addition of Li, as shown in Fig. 16. 

The transport data available for NiO are from Wright and Andrews,** 
taken on oxidized pure Ni films, and Morin,®’ taken on sintered bars 
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Fig. 17 — The conductivity of NiO (samples 1, B,; and E) and of NiO contain- 
ing Li (samples 4, 5 and 6; see Fig. 16). (Samples 1, 4, 5 and 6, Ref. 37; samples 
Bi and E, Ref. 36.) 
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containing different amounts of Li and also presumed to vary in oxygen 
content. The data are shown in Figs. 17, 18 and 19. Samples 1 and B, , 
judging from the method of preparation (see Refs. 36 and 37), their 
color (pale green) and their electrical behavior, are considered to be the 
purest and most nearly stoichiometric. Conductivity and thermoelectric 
effect. of the other samples approaches that for these at high tempera- 
tures. This occurs in a way typical of semiconductors when impurity 
centers become completely ionized and intrinsic conduction predomi- 
nates. In the intrinsic range, both Hall effect and thermoelectric effect 
remain p-type. 

According to the calculations of Section V, the energy level diagram, 
Fig. 20 can be constructed. Three parallel conduction processes are 
expected for NiO, from reactions (4) and (5): o1, high-mobility 2p 
holes; o2 , low-mobility 3d holes; and o3, low-mobility 3d electrons, so 
that total conductivity is given by 


6 =01+ 62+ 03. (9) 


Carrier concentration will be given, choosing /, = 0 as the zero of energy, 
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Fig. 18 — The Seebeck effect of NiO (samples 1, Bi , and ) and NiO contain- 
ing Li (samples 4, 5 and 6; see Fig. 16). (Samples 1, 4, 5 and 6, Ref. 37; samples 
Bi and E, Ref. 36.) 
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by 
* 3/2 
pi = o( Zener) exp (— E,/kT), (10) 
po = N exp[— (Ep — E2)/kT), (11) 
ns = N exp [— (4; — E,)/kT\, (12) 


where N is the number of cations per cu cm. No Hall effect is expected 
from carriers p2 and 7; in localized 3d levels, and none has been observed. 
Under conditions of sufficiently high purity and stoichiometry, or at 
elevated temperature where an appreciable fraction of the current is 
carried by 2p holes, a Hall effect due to these holes is expected. Since 
such a Hall effect is due only to 2p holes, but the current is carried by 
three kinds of carriers, the Hall coefficient is given by 
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Fig. 19 — The Hall coefficient of NiO. (Samples Bi: and E, Ref. 36.) 
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Fig. 20 — Energy level diagram proposed for NiO. 


Because the 3d carriers give no Hall effect, their behavior has been 
studied using another tool, the Seebeck effect. In interpreting the See- 
beck effect of the 3d oxides it is assumed that 3d carriers transport no 
kinetic energy, since they spend most of their time trapped on lattice 
sites. Therefore, there is no phonon drag effect associated with them, 
and their Seebeck effect is due only to their separation in energy from 
the Fermi level. For simplicity, it is assumed that there is no phonon 
drag effect associated with 2p holes. This assumption is probably a good 
one, since most of the data have been taken at elevated temperatures 
where this effect is at a minimum. The 2p holes are assumed to carry a 
kinetic energy of 247’. The over-all thermoelectric effect produced by 
three kinds of carriers is given by 


QTo = QiTo1 + QeTo2 — Q3To3 (14) 
or 
QTo = (Ep — Ey +- 2kT )o1 -+- (Er = E2)o2 -— (E's = Ey)o3 ‘ (15) 


The problem is to determine three carrier concentrations and three 
mobilities from only three independent experiments. This complicated 
calculation will not be attempted here. Rather, two extreme situations 
will now be assumed: (a) NiO doped with so much Li that only 3d 
holes contribute to conduction, and (b) NiO so pure that 2p holes pre- 
dominate in conduction. 

In Fig. 16 it is shown that the conductivity changes in direct propor- 
tion to the amount of Li added, for high concentrations of Li. One infers 
from this that each Li added contributes one hole to conduction and that 
nearly all the impurity centers are ionized. Thus, conduction is entirely 
extrinsic and due to 3d holes. Under these simplified conditions, (9) 
and (14) reduce to 


o = 62 (16) 
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and 
QT = (Ly — Ly). | (17) 


Knowing now, from (17), the location of the Fermi level with respect to 
the 3d° levels, the concentration, p2 , of 3d holes can be calculated from 
equation (11) if the density of states, N, available to the holes is known. 
This is assumed to be the number of cations per cu cm of crystal, which is 
5.6 X 10” in NiO, since we are dealing with holes localized on the cations 
rather than in an energy band. 

Now, with the measured o and the hole concentration calculated in 
this way from QT for sample 6, the mobility, us , of the 3d’ holes has been 
calculated and is plotted in Fig. 21. Both in magnitude and in tempera- 
ture dependence this result is in accord with the ideas concerning trans- 
port in highly localized 3d orbitals discussed in Sections I and VIII. 
Mobility at 300°K is 0.004 cm?2/volt-sec and increases exponentially with 
temperature, with an activation energy of 0.10 ev. At low temperatures, 
QT (sample 6) is constant with temperature, at 0.10 ev. If the Fermi 
level lies halfway between the 3d’ levels and the acceptor states, this 
would indicate then that the [Lit Ni+**] acceptor levels are 0.20 ev 
above the 3d’ levels in this sample, as compared with the value of 0.5 
ev calculated in Section V and shown in Fig. 15. This suggests that a some- 
what higher value should have been closen for the dielectric constant 
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Fig. 21 — The mobility of holes in the 3d levels of NiO obtained from measure- 


pent of conductivity and Seebeck effect. (See Fig. 16 for composition of samples 
5 and 6.) 
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in these calculations. On the other hand, at the high density of defects 
existing in sample 6, interaction among defects may occur, and this 
would reduce their ionization energy. 

A calculation of pe and we has been made for sample 5, in which the 
[Lit] is not in the range where oz is directly proportional to [Lit] (see 
Fig. 16). The mobility results are shown asa dashed line in Fig. 21. At 
temperatures for which 1/7’ is >1.5 & 107° they are parallel to, but 
much lower than, the results for sample 6; at higher temperatures they 
rise rapidly with temperature. It is assumed that transport of holes in 
the 3d° levels consists of Ni** states drifting by jumps from one cation to 
a neighbor cation under the influence of the applied electric field. There- 
fore, it seems likely that mobility for this type of transport will be 
independent of impurity concentrations. In fact, this is suggested in 
Fig. 16 by the direct proportionality between conductivity and Li con- 
centration for high-Li concentration. Consequently, the results on sample 
5 probably mean that conduction in the 2p band is appreciable in this 
sample and cannot be neglected in any sample containing less than a 
3 X 107° atom fraction of Li (or approximately 2 X 107° Li atoms/ 
em’). This suggests, further, that the 2p band is not much farther below 
Ey than are the 3d levels, in accord with the energy level diagram in 
Fig. 15. 

Now assume the second situation, pure NiO, with only 2p holes in- 
volved in conduction and yu: = Ryo. This result for sample / is shown 
in Fig. 22. Also, from this assumption (15) reduces to 


QT = (Ey + 2kT). (18) 


From this, p: can be calculated from (10), assuming m*/m = 1. Since 
o1 = G6, wi can again be calculated. This has been done for samples 1 
and By, and is also plotted in Fig. 22. There, the yu: results are compared 
with y,, a lattice-scattering mobility considered reasonable for carriers 
in the wide band of an oxide. The high values found for yw: support the 
idea that 2p holes are involved in the conduction. The temperature 
dependence that is found is quite far from a reasonable one, showing that 
the other conduction processes also play a role in these samples and 
cannot be neglected. These results, then, do not represent true 2p hole 
mobility. They do indicate, however, that the energy level scheme pro- 
posed for NiO is a reasonable one. With this model in mind, the QT 
results of Fig. 18 can be understood in a qualitative way. At low tempera- 
ture, Hp is midway between FH, and E£4, the acceptor levels, so that 
g = 06; + oy. As the temperature is raised, #y moves upward through 
EH, as acceptors become completely ionized, and QT increases accordingly. 
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Fig. 22 — The mobility of holes in the 2p band of stoichiometric NiO obtained 
from measurement of conductivity Hall effect and Seebeck effect. 


Finally, at high temperatures, all the acceptors become ionized in the 
purest samples, Hr lies nearly midway between EF, and £;, and QT' be- 
comes constant with temperature. Under these conditions, QeTo2, — 
Q3;7T'o3 approaches zero, and Q7' is p-type, due largely to 2p holes. At 
high temperature, measured QT for samples 1 and Bi is about 1 
ev. Furthermore, the slope of the conductivity and Hall coefficient 
plots for these two samples is also about 1.0 ev at high temperature. 
These results suggest that the 2p band is about 1.0 ev below the Fermi 
level in the intrinsic situation and, therefore, that the energy levels of 
Fig. 15 should be scaled down by a factor of one-half. This is also in 
fair agreement with the finding that the [Lit Nit**] acceptor level lies 
0.2 ev above the 3d level rather than 0.5 ev above, as was calculated. 
Evidently the value of ten would have been a better assumption for the 
dielectric constant of NiO. 


VII. IRON OXIDE, Fe.03 


Magnetic and structural properties of the rhombohedral sesquioxide 
of iron have been described in Section IT. Its semiconductor behavior is 
very similar to that of NiO, except that FeO; can exist as either n- or 
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p-type because the cation can take on a lower as well as a higher valence 
state. Oxidation causes pure Fe.O3 to become p-type, probably due to 
the production of Fe**** ions in the vicinity of cation vacancies. Re- 
duction causes pure FeO; to become n-type, probably due to the pro- 
duction of excess cations in the form of Fet* ions. With respect to the 
normal lattice of Fet+* ions, Fet*** is a hole and Fet an electron. 
The room-temperature resistivity of Fe,03 is approximately 10!% ohm-cm. 
This can be lowered to about 4 ohm-em by the addition of Ti. It is known 
from an X-ray study® that, in the isomorphous compound FeTiO;, 
the cations are Fett and Tit*** at room temperature. Presumably 
then, Ti+** in Fe.03; also ionizes, to give a 3d electron (Fet*) and a 
Tit jon. The change in resistivity with added Ti is shown in Fig. 23. 
At high Ti concentrations the resistivity does not decrease with added 
Ti. This may be due to a shifting oxygen equilibrium with Ti content, 
which introduces acceptors that compensate for some of the donors. 

Transport data!® *° available for Fe203 have been obtained on sintered 
bars, and consist of conductivity, Fig. 24, and thermoelectric effect, Fig. 
25. Samples 1 and A are identical in conductivity, but the thermo- 
electric effect shows that, due to the different conditions of sintering, 
sample 1 is p-type and sample A is n-type. The QT data also show that 
both of these samples tend to be p-type at high temperatures, which is 
in accord with (14) and the idea that the general energy level scheme 
which was developed in Section V for NiO is probably applicable to 
most of the simple oxides. However, in Fe.03, QT is smaller in the in- 
trinsic range than it is in NiO. This suggests that the 3d levels in Fe.O; 
are higher above the 2p band than they are in NiO. 
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Fig. 23 — The resistivity of Fe,O; as a function of Ti content. (Ref. 18). 
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Fig. 24 — The conductivity of Fe:0; (samples 1, A and 2) and Fe,O; containing 
Ti (samples 3 and 4; see Fig. 23 for composition). 


The situation in Fe.O; , where both donors and acceptors are present, 
is somewhat more complicated than in NiO. However, the simple case 
in which Fe,O; is so heavily doped with Ti that conduction is only by 3d 
electrons can be dealt with. Such a situation is probably represented by 
sample 4, in which 


oC = 03, (19) 
QT = (Ls a Er), (20) 


and n; is given by (12), where N = 4.0 X 10” cm. From these equa- 
tions and the data for sample 4, u3 has been determined. It is found to 
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Fig. 25 — The Seebeck effect of Fe2O; (samples 1, A and 2) and Fe.0; contain- 
ing Ti (samples 3 and 4; see Fig. 23 for composition). 


be identical with the hole mobility for NiO sample 6, shown in Fig. 21. 
Thus the mobility of 3d electrons in Fe2Q; is in accord, both in magni- 
tude and temperature dependence, with the idea of transport in localized 
3d levels. At T = 300°K, ws = 0.004 cm?/volt-sec and increases ex- 
ponentially with temperature with an activation energy of 0.1 ev. 


VIII. TRANSPORT IN THE 3d LEVELS 


In Section V the processes were described by which conduction elec- 
trons or holes are created in the 3d levels. In NiO, for example, a 3d 
hole freed from a cation defect is presumed to consist of a Nit** with 
the cation neighbors for some distance around it normal Nit* ions. The 
hole can diffuse about the cation lattice by exchanging an electron with 
a Ni** neighbor until it moves next to an ionized defect site, receives an 
electron and becomes trapped, or until it becomes annihilated by re- 
combination with a free electron. A net drift of such holes occurs when 
an external electric field is applied to the crystal, and thus an electric 
current is obtained. 

In Sections VI and VII the mobility of 3d charge carriers was shown 
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to be very low and to increase exponentially with temperature. These 
results are consistent with a suggestion made 20 years ago by de Boer 
and Verwey’ that conduction in NiO and Fe.O; occurs as outlined above 
and that a potential barrier exists which must be overcome for a charge 
carrier to jump from one cation to another. The potential barrier is 
assumed to arise because the charge carrier spends a time on a par- 
ticular cation which is long compared to the vibrational frequency of 
the lattice, and this allows the ions surrounding a charge carrier to be- 
come displaced by its electric field. Evidence that this occurs in FeO; 
containing Ti was given in Section III. This displacement creates a field 
which tends to prevent the charge carrier from leaving its lattice site, 
and so the charge carrier has trapped itself.*?: 44 These ideas can be repre- 
sented by two energy levels, one for a hole, the other for a normal cation 
neighbor, being moved up and down in energy by phonons. Whenever 
~ the two levels become the same, charge transfer can occur; thus, an 
exponential temperature dependence would be found for transport. 

One might expect this charge transfer to depend upon the electron 
configuration of the cation. Using the diagram in Fig. 6 one can see 
that, if Hund’s rule* is followed, a hole in MnO and CuO will be in the 
dy levels but a hole in FeO, CoO and NiO will be in the de levels. Now, 
since the de orbitals are directed between anions and towards neighboring 
cations while dy orbitals are directed at anions, transport in the two 
levels may be quite different. For example, transport in the dy levels, 
through an anion as in superexchange, will involve spin reversal below 
Ty regardless of crystal orientation. Transport in the de levels will not 
involve spin reversal if it is in the (111) planes of parallel spins but will 
require spin reversal if it is perpendicular in the planes of parallel spin, 
and it so may produce an anisotropy in conductivity, at least within a 
magnetic domain. 

If spin reversal is involved in transport, one might expect the activa- 
tion energy for transport below 7'y to be the sum of the polarization 
energy plus the exchange energy, and to be only the polarization energy 
above Ty . This would introduce an abrupt change in the temperature 
dependence of conductivity at Ty . Heikes and Johnston” believe that 
they have seen this effect in sintered samples of the oxides MnO, CoO, 
NiO and CuO which contained a large concentration of Li. However, 
the results described in Section VI for NiO containing Li (sample 6) 
show no change in the slope of mobility (Fig. 20) at Ty , but do show 


* That the d levels are filled with electrons in such a way as always to main- 
tain a maximum spin multiplicity. 
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that the slope of conductivity is changing in this temperature region, 
because the Fermi level (Fig. 18) is beginning to move upward. It is evi- 
dent that these questions can be answered only by transport studies on 
single crystals. 
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